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PREFACE 


SPONSORSHIP  OF  THE  WORK 

This  volume  was  prepared  by  the  Statistical  Research  Group,  Divi- 
sion of  War  Research,  Columbia  University,  for  the  Applied  Mathematics 
Panel,  National  Defense  Research  Committee,  Office  of  Scientific 
Research  and  Development.  The  work  was  initiated  under  contract 
OEMsr-618  and  completed  under  contract  OEMsr-1131,  both  between 
the  Trustees  of  Columbia  University  and  the  Office  of  Scientific  Research 
and  Development.  Royalties,  at  usual  rates,  are  being  paid  by  the 
publisher  to  the  United  States  government ;  no  royalties  are  being  paid  to 
Columbia  University  or  to  any  private  institution  or  individual.  The 
publisher  has  agreed  that  ten  years  after  the  date  on  which  this  volume  is 
issued  the  copyright  thereon  shall  be  relinquished  and  the  work  shall 
become  part  of  the  public  domain. 

The  Statistical  Research  Group  was  organized  and  its  initial  staff 
selected  by  Warren  Weaver,  Chief  of  the  Applied  Mathematics 
Panel,  on  July  1,  1942.  The  staff  was  drawn  from  universities  and 
research  organizations  throughout  the  country,  many  of  which  generously 
granted  leaves  for  the  work.  The  group  disbanded  September  30,  1945, 
except  for  a  few  persons  who  have  prepared  this  and  another  volume  as 
monographs  in  connection  Avith  the  Applied  Mathematics  Panel's  Sum- 
mary Technical  Report. 

NATURE  OF  THE  BOOK 

This  book  discusses  a  series  of  problems  that  occur  frequently  in 
planning,  analyzing,  or  interpreting  quantitative  data.  Various  tech- 
niques appropriate  to  these  problems  are  explained,  in  terms  of  both 
general  principles  and  specific  procedures. 

The  book  is  written  for  scientists,  engineers,  managers,  and  statis- 
ticians. It  seeks  to  explain  the  statistical  approach  to  various  problems, 
to  expound  the  theory  that  can  be  brought  to  bear  on  each  problem,  to 
point  out  the  features  of  the  problem  that  determine  the  relevance  of  the 
theory,  to  give  detailed  and  workable  computational  procedures,  includ- 
ing formulas,  tables,  and  charts,  and  to  illustrate  the  interpretation  of 
the  results.  Concrete  illustrative  material  is  used  to  bring  out  the 
interplay  of  substantive  and  statistical  considerations,  the  illustrations 
being  selected  primarily  for  the  light  they  throw  on  the  art  of  applying 
statistical  techniques  and  only  secondarily  for  intrinsic  interest.  Although 
relatively  little  new  statistical  theory  is  included,  several  topics  are  given 
more  thoroughly  integrated  and  carefully  expounded  treatment  than  has 
previously  been  available,  and  many  new  tables  are  included. 
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In  preparing  this  book,  the  authors  have  kept  in  mind  the  reader  who 
is  interested  in  applying  statistical  techniques  in  his  work  but  who  either 
does  not  have  much  interest  in  or  time  for  studying  general  statistical 
theory  as  such,  or  else  feels  that  he  can  best  understand  and  appreciate 
statistical  theory  through  a  series  of  special  problems.  Even  the  statis- 
tical theorist,  however,  should  be  able  to  deepen  and  broaden  his  under- 
standing and  appreciation  of  statistical  theory  by  examining  special 
problems  in  some  detail. 

The  chemical,  physical,  and  engineering  sciences  are,  on  the  whole, 
distinctly  backward  in  the  statistical  planning  of  experiments  and  the 
analysis  of  empirical  data,  at  least  by  comparison  with  the  social,  biologi- 
cal, and  agricultural  sciences.  No  doubt  this  is  partly  because  physical 
scientists  have  attained  a  high  degree  of  proficiency  in  experimental 
techniques  and  can  make  repeated,  independent  experimental  investiga- 
tions of  a  given  phenomenon — they  can  " think  with  their  hands" — so 
that  competent  investigators  are  likely  to  learn  the  truth  (though  perhaps 
belatedly  and  at  excessive  cost)  despite  crudities  in  statistical  techniques. 
In  the  social  and  biological  sciences,  on  the  other  hand,  repetition  or 
control  of  experimental  procedures  is  usually  impossible,  at  least  in  any- 
thing like  the  degree  achieved  in  the  physical  sciences,  so  that  the  truth 
can  be  learned  only  through  highly  developed  statistical  techniques 
closely  coupled  (as  statistical  techniques  must  always  be)  with  sound 
scientific  knowledge  and  techniques.  But  even  though  sound  statistical 
techniques  may  not  be  indispensable  in  the  physical  sciences,  they  are 
invaluable.  Increasing  recognition  of  this,  accelerated  by  developments 
during  the  war,  is  leading  to  the  rapid  introduction  of  statistical  methods 
into  chemistry,  physics,  and  engineering;  indeed,  it  seems  likely  that  in 
the  immediate  future  the  greatest  contributions  of  statistics  will  be  to, 
and  its  greatest  developments  will  arise  from,  the  physical  sciences. 
(The  physical  sciences  may,  however,  yield  this  distinction  to  medicine, 
which  is  said  to  be  nearly  as  dependent  on  statistics  as  the  social  sciences 
and  nearly  as  backward  statistically  as  the  physical  sciences.) 

BACKGROUND 

During  the  war  the  Statistical  Research  Group  carried  on  three  kinds 
of  work:  (1)  It  advised  and  assisted  the  Army,  the  Navy,  and  the  Office 
of  Scientific  Research  and  Development  on  the  statistical  aspects  of 
problems  arising  in  their  activities;  this  constituted  the  largest  part  of 
the  work.  (2)  It  assumed  primary  responsibility  for  the  investigation  of 
certain  problems  of  a  predominantly  statistical  or  probabilistic  character. 
(3)  It  developed,  reviewed,  or  expounded  statistical  techniques;  this  was 
the  smallest  part  of  the  work  and  was  incidental  to  the  first  two. 
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The  great  bulk  of  the  work  fell  into  the  first  two  categories,  and  the 
final  products  did  not  emphasize  statistical  techniques  as  such.  From 
this  work,  however,  there  arose  a  number  of  developments  in  applied 
statistics,  as  well  as  some  in  theoretical  statistics.  This  book  has  been 
prepared  to  preserve  and  to  disseminate  useful  statistical  material  that 
was  almost  a  by-product  of  the  responsible  and  cooperative  handling  of 
practical  military,  scientific,  engineering,  and  production  problems. 

AUTHORSHIP 

Because  it  would  be  difficult  or  impossible  accurately  to  assign  per- 
sonal credit  and  responsibility  for  this  volume,  its  authorship  is  attributed 
to  the  Statistical  Research  Group  as  a  whole.  Under  ordinary  circum- 
stances the  basic  material  of  the  various  chapters  would  have  been 
published  in  technical  journals  by  the  originators  individually.  Under 
the  actual  circumstances,  many  ideas  are  presented  here  for  the  first 
time  in  chapters  prepared  by  others  than  those  who  did  the  basic  work. 
It  thus  becomes  important  to  assign  credit  for  the  ideas  in  such  chapters 
without  at  the  same  time  implying  responsibility  for  the  entire  content 
or  emphasis.  An  attempt  has  been  made  to  do  this  by  notes  at  the 
beginning  of  the  chapters.  The  editors,  and  in  particular  the  under- 
signed, however,  have  exercised  final  judgment  and  authority  over  the 
form  and  content  of  each  chapter  and  are  thus  in  the  unhappy  position 
of  having  to  assume  all  responsibility  for  the  shortcomings  of  the  book 
without  being  able  to  claim  credit  for  its  merits. 

The  following  were  members  of  the  senior  scientific  staff  of  the  Statistical 
Research  Group,  and  the  influence  of  each  is  reflected  in  the  book. 

Kenneth  J.  Arnold,  University  of  Wisconsin 
Rollin  F.  Bennett,  Columbia  University 
Julian  H.  Bigelow,  Institute  for  Advanced  Study 
Albert  H.  Bowker,  Stanford  University 
Churchill  Eisenhart,  National  Bureau  of  Standards 
(on  leave  from  University  of  Wisconsin) 
H.  A.  Freeman,  Massachusetts  Institute  of  Technology 
Milton  Friedman,  University  of  Chicago 
M.  A.  Girshick,  U.  S.  Bureau  of  the  Census 
Millard  W.  Hastay,  National  Bureau  of  Economic  Research 
Harold  Hotelling,  University  of  North  Carolina 
Edward  Paulson,  University  of  North  Carolina 
L.  J.  Savage,  University  of  Chicago 
Herbert  Solomon,  Stanford  University 
George  J.  Stigler,  Brown  University 
A.  Wald,  Columbia  University 
W.  Allen  Wallis,  University  of  Chicago 
J.  Wolf o wit z,  Columbia  University 
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CHAPTER  1 

USE  OF  VARIABLES  IN  ACCEPTANCE  INSPECTION 
FOR  PERCENT  DEFECTIVE 

1.  ATTRIBUTES  AND  VARIABLES 

Acceptance  inspection  is  the  process  of  judging  whether  a  lot  of  items 
is  of  acceptable  quality  by  examining  a  sample  from  the  lot.  Such  inspec- 
tion is  widely  and  successfully  used  in  industry.  It  is  usually  carried  out 
by  classifying  each  item  in  the  sample  as  either  defective  or  nondefective, 
then  deciding  from  the  number  of  defective  items  in  the  sample  whether 
the  percentage  of  defective  items  in  the  lot  is  small  enough  for  the  lot  to 
be  considered  acceptable.  Thus,  the  quality  of  the  lot  is  measured  by  the  f 
percentage  of  defective  items,  and  this  is  judged  by  classifying  the  sample 
items  according  to  attributes. 

Often  the  sample  items  are  classified  as  defective  or  nondefective 
on  the  basis  of  a  measurable  quality  characteristic.  For  example,  the 
items  may  be  classified  as  too  heavy  or  not,  although  the  actual  weight 
could  be  measured ;  they  may  be  classified  as  too  short  or  not,  although  the 
length  could  be  measured;  they  may  be  classified  as  having  an  excessive 
volatile  content  or  not,  although  the  volatile  content  could  be  measured; 
they  may  be  classified  as  operating  too  fast  or  not,  although  the  speed 
could  be  measured ;  they  may  be  classified  as  having  too  short  a  useful  life 
or  not,  although  the  useful  life  could  be  measured.  Such  cases,  where 
inspection  is  based  on  attributes,  may  be  contrasted  with  cases  in  which 
inspection  is  based  on  variables,  that  is,  on  the  actual  measured  values  of 
the  characteristic. 

Inspection  by  variables  has  one  major  advantage  over  inspection  by 
attributes :  fewer  items  need  be  inspected  for  a  given  degree  of  accuracy 
in  judging  a  lot's  acceptability;  or,  conversely,  decisions  about  the  lot 
are  more  reliable  if  based  on  a  sample  inspected  by  variables  than  if 
based  on  a  sample  of  the  same  size  inspected  by  attributes.  The  reason 
for  this  is  that  variables  inspection  provides  more  information  about 
the  quality  of  each  item;  for  example,  that  the  volatile  content  is  16.02 
percent  or  20.61  percent,  as  the  case  may  be,  instead  of  merely  that  the 
volatile  content  exceeds,  say,  16  percent  in  both  cases.  Since  variables 
inspection  provides  more  information  about  each  item,  a  smaller  number  of 
sample  items  gives  the  same  amount  of  information  about  the  lot,  or  the 
same  number  of  sample  items  gives  more  information  about  the  lot. 
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The  principal  advantage  of  inspection  by  attributes  over  inspection 
by  variables  is  that  it  requires  less  skill,  less  time,  or  less  expensive  equip- 
ment to  inspect  each  item;  less  record-keeping  to  record  the  results  of 
inspection;  less  arithmetic  to  determine  from  the  sample  results  whether 
the  lot  should  be  judged  acceptable.  Usually  this  advantage  of  simplicity 
more  than  offsets  the  fact  that  more  items  must  be  inspected  under 
attribute  than  under  variables  inspection. 

Another  advantage  of  attribute  inspection  is  that  the  selection  and 
installation  of  sound  sampling  plans  is  easier.  In  fact,  complete  and 
detailed  procedures  are  available  in  such  form  that  little  or  no  previous 
knowledge  of  technical  statistics  is  required  for  their  successful  use.*  In 
attribute  inspection  the  mathematical  theory  about  the  manner  in  which 
quality  varies  from  one  item  to  another  is  always  fulfilled  if  the  sample  is 
selected  at  random  from  the  lot.  The  mathematical  theory  underlying 
variables  inspection  plans,  on  the  other  hand,  is  that  quality  varies  among 
the  items  of  the  lot  in  accordance  with  a  normal  (Gaussian)  distribution; 
whether  this  theory  is  a  sufficiently  good  approximation  to  the  facts  in  a 
particular  inspection  problem  is  a  matter  requiring  investigation  by 
statistical  methods. 

There  are  other  less  important  reasons  for  preferring  attributes  to 
variables.  One  of  these  is  a  misapprehension  as  to  the  amount  of  arith- 
metic required,  often  based  in  large  part  on  ignorance  of  short-cut 
computational  procedures.  Another  such  reason  is  exaggeration  of  the 
limitations  imposed  by  the  normality  assumption  in  the  mathematics 
underlying  variables  inspection. 

Attributes  are  sometimes  used  instead  of  variables  because  well-known 
sampling  plans  for  variables  measure  lot  quality  by  the  average  or  the 
variability  of  the  measurements,  while  well-known  attribute  plans  meas- 
ure it  by  the  percent  defective.  Percent  defective  is  often  the  appropriate 
measure,  and  its  appropriateness  is  not  affected  by  shifting  between 
attributes  and  variables.  For  example,  in  inspecting  rods  for  length,  the 
percent  too  short  may  in  some  circumstances  be  a  better  measure  of 
quality  than  either  the  mean  or  the  variability  of  length — as  when 
rods  below  a  certain  length  are  useless  while  those  above  that  length  can 
easily  be  trimmed  at  virtually  no  extra  cost.  Similarly,  in  determining 
whether  individual  packages  in  a  lot  provide  full  measure,  the  percent 
below  the  specified  measure  may  be  more  significant  than  the  average 
measure.  Variables  plans  for  percent  defective  exist  but  are  not  widely 
known,  f 

*  See,  for  example,  Statistical  Research  Group,  Sampling  Inspection;  Dodge  and 
Romig,  Sampling  Inspection  Tables  (for  full  citations  see  Sec.  13). 

f  See  especially  the  following  two  papers,  of  which  this  chapter  is  primarily  a 
development:  Jennett  and  Welch,  "The  Control  of  Proportion  Defective";  Johnson 
and  Welch,  "  Applications  of  the  Non-central  t  Distribution." 
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Although  in  most  cases  attribute  plans  are  undoubtedly  more  economi- 
cal than  variables  plans  for  percent  defective,  because  the  lower  cost  of 
inspecting  each  item  usually  more  than  offsets  the  increase  in  the  number 
of  items  that  must  be  inspected,  variables  inspection  offers  decided 
advantages  in  far  more  instances  than  has  usually  been  recognized.  This 
is  especially  true  when  the  inspection  process  is  costly,  as  for  example 
when  it  requires  expensive  time,  equipment,  or  personnel,  or  when  the 
items  are  expensive  and  are  damaged  or  destroyed  by  inspection.  It  is 
also  true  when  measurement  by  variables  costs  only  a  little  more  than 
classification  by  attributes,  or  when  for  purposes  other  than  acceptance 
inspection  the  data  on  variables  are  more  valuable  than  those  on  attrib- 
utes. Thus  it  is  desirable  that  the  practical  aspects  of  variables  plans  be 
more  fully  developed  and  the  plans  made  more  generally  available  than 
has  heretofore  been  the  case. 

2.  BACKGROUND  OF  THE  PLAN 

A  variables  plan  of  acceptance  inspection  for  percent  defective  is  based 
upon  the  sample  mean  and  the  sample  standard  deviation.  Other  meas- 
ures of  central  tendency  and  variability  could  be  used,  for  example,  the 
median  and  the  range.  No  other  measures  are  considered  in  this  chapter, 
however,  because  other  measures  all  require  larger  samples  to  give  as  good 
results  as  the  mean  and  standard  deviation.  The  gains  in  computational 
simplicity  that  may  be  afforded  by  other  measures  are  likely  to  be  unim- 
portant in  the  situations  for  which  variables  inspection  is  appropriate. 
Although  in  routine  inspection  operations  it  is  not  necessary  to  compute 
either  the  mean  or  the  standard  deviation,  it  will  clarify  the  ideas  to  begin 
the  discussion  in  terms  of  them. 

A  normal  distribution,  such  as  that  shown  in  Fig.  1.1,  is  characterized 
by  two  quantities  (parameters),  its  mean  /i  and  its  standard  deviation  a. 
Normal  distributions  have  large  or  small  means  according  as  /i  is  large  or 
small ;  the  observations  are  tightly  clustered  or  widely  scattered  about  the 
mean  according  as  a  is  small  or  large.  The  proportion  of  the  observations 
exceeding  a  certain  upper  limit,  say  U,  depends  upon  how  far  the  limit  is 
above  the  mean,  measured  in  terms  of  the  standard  deviation.  For 
example,  if  the  limit  exceeds  the  mean  by  twice  the  standard  deviation, 
that  is,  if  U  =  n  +  2a,  2.275  percent  of  the  observations  exceed  the  limit; 
but  if  the  limit  exceeds  the  mean  by  three  times  the  standard  deviation, 
that  is,  if  U  =  m  +  3<r,  only  0.135  percent  of  the  observations  exceed  the 
limit.  To  determine  the  percent  of  observations  exceeding  any  limit,  first 
subtract  the  mean  from  the  limit  and  then  divide  the  difference  by  the 
standard  deviation;  that  is,  compute 

K  =  V^  (1) 
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.which  shows  the  difference  between  the  limit  and  the  mean  as  a  multiple  of 
the  standard  deviation.  Various  tables  show  for  different  values  of  K 
what  part  of  a  normal  distribution  exceeds  K.  * 


Normal  distribution. 


From  formula  (1)  it  is  clear  that  the  proportion  exceeding  a  given 
limit  U,  which  we  will  call  the  fraction  defective,  depends  upon  both  the 
mean  and  the  standard  deviation.  For  a  given  standard  deviation,  the 
fraction  defective  is  smaller  the  smaller  the  mean;  but,  however  small 
the  mean,  the  standard  deviation  may  be  large  enough  to  cause  a  high 

*  See,  for  example,  Table  1.1  in  this  chapter.  By  far  the  most  extensive  table  is 
that  prepared  by  the  New  York  Mathematical  Tables  Project,  Tables  of  Probability 
Functions^  This  gives  probabilities  to  15  decimal  places  for  values  of  K  (denoted  by 
x  in  the  tables)  from  0  to  1  by  steps  of  0.0001,  and  for  all  values  of  K  above  1  by  steps 
of  0.001. 

Some  care  is  required  in  using  tables  of  the  normal  probability  function,  since 
two  different  kinds  of  table  are  common  and  several  methods  of  tabulation  are  used. 
The  two  kinds  of  table  differ  in  the  units  used  to  measure  the  difference  between  the 
limit  and  the  mean,  the  most  common  type  (illustrated  by  Vol.  II  of  the  NYMTP 
tables)  measuring  this  as  a  multiple  of  the  standard  deviation  and  the  other  type 
(illustrated  by  Vol.  I  of  the  NYMTP  tables)  measuring  it  as  a  multiple  of  y/2  times 
the  standard  deviation.  Thus,  to  enter  the  second  type  of  table,  K  computed  from 
formula  (1)  must  be  divided  by  -y/2  =  1.414.  In  either  type  of  table  the  entries  may 
represent  (a)  the  proportion  exceeding  K,  which  is  the  same  as  the  proportion  below 
—  K;  (6)  the  proportion  below  K,  which  is  the  same  as  the  proportion  above  —  K; 
(c)  the  proportion  between  the  mean  and  K,  which  is  the  same  as  the  proportion 
between  the  mean  and  —  K;  (a1)  the  proportion  between  —  K  and  K;  (e)  the  proportion 
below  —K  and  above  -\-K.  Given  any  one  of  these  five  values,  the  others  are  easily 
computed  from  the  facts  that  the  distribution  is  symmetrical  about  the  mean  and 
that  the  total  proportion  is  one. 
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fraction  defective.  For  a  fixed  mean  less  than  the  limit,  the  fraction 
defective  is  smaller  the  smaller  the  standard  deviation;  but,  however 
small  the  standard  deviation,  the  mean  may  be  large  enough  to  cause  a 
high  fraction  defective.  The  acceptability  of  any  mean,  when  quality 
is  measured  by  fraction  defective,  depends  on  the  standard  deviation;  and 
the  acceptability  of  any  standard  deviation  depends  on  the  mean. 

Figure  1.2  illustrates  various  combinations  of  the  mean  and  standard 
deviation  that  correspond  to  the  same  lot  quality,  as  measured  by  fraction 


0  U  a 

Fig.  1.2. — Combinations  of  /x  and  <r  corresponding  with  the  same  percent  defective. 

defective.  Any  combination  corresponding  to  a  point  on  the  sloping 
line  represents  the  same  quality;  combinations  to  the  left  of  the  line 
represent  better  quality  (lower  fraction  defective) ,  those  to  the  right  worse 
quality.  Any  lot  quality  can  be  represented  by  such  a  line,  the  percent 
defective  being  greater  the  steeper  the  line.*  Thus  a  lot  is  accept- 
able, in  the  sense  that  it  has  a  fraction  defective  less  than  or  equal 
to  a  specified  acceptable  quality  level  (AQL),  whenever  (U  —  ii)/a  is 
greater  than  or  equal  to  a  certain  value  of  K  which  depends  on  the  AQL 
and  can  be  determined  from  tables  of  the  normal  probability  function. 
In  other  words,  the  lot  is  acceptable  if  its  mean  plus  a  certain  multiple  of 
its  standard  deviation  does  not  exceed  the  limit;  tfhat  is,  if 

M  +  Ka  <  U  (2) 

*  Lot  qualities  worse  than  50  percent  defective  would  be  represented  by  a  line 
starting  at  ju  =  U,  a  = 0  and  sloping  upward  to  the  right.  When  the  mean  exceeds 
the  limit  the  percent  defective  exceeds  50,  and  for  a  given  mean  the  percent  defective 
is  smaller  the  larger  the  standard  deviation.  Such  cases  are  rarely  of  interest  in 
acceptance  sampling  but  can  be  treated  by  formulas  (1'),  (2'),  and  (3')  below. 
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The  discussion  so  far  has  been  entirely  in  terms  of  the  lot  mean  and 
standard  deviation.  In  practice,  the  lot  mean  and  standard  deviation, 
li  and  a,  are  unknown,  and  it  is  necessa^  to  judge  from  the  sample  mean 
and  standard  deviation,  x  and  s,  whether  they  satisfy  criterion  (2) .  This 
can  be  done  by  adding  a  multiple  of  the  sample  standard  deviation  to  the 
sample  mean  and  comparing  the  sum  with  the  limit.  In  choosing  the 
multiple  of  the  sample  standard  deviation,  however,  allowance  must  be 
made  for  the  fact  that  the  mean  and  standard  deviation  determined  from 
the  sample  are  subject  to  fluctuations  from  sample  to  sample.  That  is,  if 
a  number  of  samples  are  drawn  from  lots  having  exactly  the  same  fraction 
defective,  the  samples  will  not  all  yield  the  same  value  of  (U  —  x)/s. 
Thus  the  acceptable  level  of  (U  —  x)/s,  say  k,  must  be  small  enough  so 
that  there  will  not  be  too  high  a  probability  that  an  acceptable  lot  will 
fail  to  meet  the  acceptance  criterion  for  the  sample,  namely, 

x  +  ks  <  U  (3) 

If  the  sample  contains  measurements  on  N  items  (that  is,  contains 
N  observations)  represented  by  xh  x2,   *   •   ■  xN,  the  sample  mean  x  is 

Xi  +  X2  +    •    •    •     +  Xn  Sx?:  ... 

~ir.    — =-jf  (4) 

the  sample  variance  s2  is 

2        (xi  —  x)2  +  (xt  —  x)2  +    •   •   •    +  (xN  —  x)2        X(Xi  —  x)2         . 

s  = w^t~  —  =  ~A^=n~   (5j 

and  the  sample  standard  deviation  5  is  the  square  root  of  the  sample 
variance. 

A  few  miscellaneous  remarks  will  complete  the  background  of  vari- 
ables inspection  plans  for  percent  defective: 

The  symbols  m  and  <r  are  used  to  indicate  the  population  mean  and 
standard  deviation;  x  and  s  are  used  to  represent  the  sample  mean  and 
standard  deviation.  Similarly,  K  is  used  to  represent  the  deviation 
of  the  lot  mean  from  the  limit  in  units  of  the  lot  standard  deviation,  and 
k  is  used  to  represent  the  deviation  of  the  sample  mean  from  the  limit  in 
units  of  the  sample  standard  deviation.  This  is  in  conformity  with 
the  spund  and  increasingly  prevalent  practice  of  using  notation  that 
emphasizes  the  distinction  between  true  lot  (population  or  universe) 
parameters  and  sample  estimates  of  these  parameters — an  estimate  mean- 
ing in  statistical  usage  a  value  calculated  from  a  sample  (not  guessed  or 
judged  in  the  everyday  sense  of  " estimate")  according  to  established 
principles  for  estimating  lot  parameters  from  sample  data.  Neglect  of 
the  distinction  between  lot  parameters  and  sample  estimates  of  them 
can  be  (and  often  is)  a  cause  of  serious  confusion.     The  quantities  x  and 
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s,  which  for  brevity  have  been  called  the  sample  mean  and  standard 
deviation,  should  properly  be  called  the  sample  estimates  of  the  lot  mean 
and  standard  deviation. 

The  use  of  N  —  1  instead  of  N  in  the  denominator  of  the  variance 
[see  formula  (5)]  has  no  real  influence  on  the  sampling  plan  provided  the  k 
of  expression  (3)  is  chosen  with  due  regard  to  whether  N  or  N  —  1  is  used. 
If  N  is  used,  the  proper  value  of  k  is  greater  by  a  factor  \/N/(N  —  1) 
than  if  N  —  1  is  used,  so  k  times  s  is  the  same  in  either  case.  If,  however, 
the  value  of  5  is  itself  of  interest,  the  use  of  N  —  1  is  preferable. 

Although  formula  (5)  is  useful  for  bringing  out  the  meaning  of  the 
standard  deviation,  it  should  not  be  used  for  computing  purposes.  For 
computing,  use  one  of  the  following  two  forms : 


(6a) 

N(N  -  1)  (66) 

For  purposes  of  acceptance  inspection,  however,  it  is  not  necessary  (as 
will  be  seen  later)  actually  to  compute  the  standard  deviation. 

For  concreteness  it  is  assumed  that  defective  items  are  those  for  which 
the  value  of  the  quality  characteristic  exceeds  some  upper  bound  of 
satisfactory  quality  U.  It  would  be  equally  appropriate  to  assume  that 
defective  items  are  those  below  some  minimum  M.  In  that  case,  equa- 
tions (1),  (2),  and  (3)  would  be 

K  =  <L=JL  (1/) 

a 
n  -  K<r>  M  (2') 

x  -  ks>  M  (3') 

In  what  follows,  the  assumption  is  retained  that  defectives  are  items  that 
are  too  large,  but  from  equations  (1'),  (2'),  and  (3;)  the  application  to 
cases  where  defectives  are  items  that  are  too  small  will  be  obvious. 

3.  THREE  PHASES  OF  A  VARIABLES  SAMPLING  PLAN 
3.1.  The  Three  Phases 

A  sampling  procedure  may  be  analyzed  into  three  phases: 

(1)  The  plan  of  action,  that  is,  the  set  of  rules  on  the  basis  of  which 
to  accept  or  reject  the  lot.* 

(2)  The  amount  of  inspection  required  b3'  the  plan,  that  is,  the  number 
of  items  that  must  be  inspected  from  each  lot. 

*  Some  plans  allow  a  third  decision,  to  inspect  more  items;  see  Sec.  11.6. 
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(3)  The  operating  characteristics  of  the  plan,  that  is,  the  proportions 
of  submitted  lots  of  various  qualities  that  will  be  accepted  and  rejected 
if  the  plan  is  used. 

3.2.  Plan  of  Action 

The  plan  of  action  for  variables  inspection  for  percent  defective  is  as 
follows : 

a.  A  sample  of  N  items  is  selected  at  random  from  the  lot. 

b.  The  quality  characteristic  being  inspected  is  measured  and  recorded 
for  each  item  of  the  sample. 

c.  The  mean  and  standard  deviation  are  computed  from  the  N 
measurements. 

d.  The  sample  standard  deviation  is  multiplied  by  a  certain  constant 
k  and  added  to  the  sample  mean. 

e.  The  lot  is  rejected  if  the  value  of  x  +  ks  found  in  step  d  exceeds  U, 
the  limit  of  the  quality  characteristic  above  which  an  item  is  considered 
defective;  otherwise,  the  lot  is  accepted. 

In  practice,  steps  c  and  d  can  be  replaced  by  a  single  step  that  consists 
essentially  of  adding  the  squared  values  of  the  N  sample  measurements; 
in  that  case  step  e  remains  essentially  as  stated,  though  the  actual 
numbers  used  in  it  are  different. 

Thus,  the  plan  of  action  is  characterized  by  two  parameters:  the 
sample  size  N  and  the  multiplier  of  the  standard  deviation  k.  (U  is 
associated  with  the  definition  of  quality  rather  than  with  the  sampling 
plan,  since  it  simply  defines  a  defective  item.) 

3.3.  Amount  of  Inspection 

Under  the  plans  considered  here,  which  are  single-sampling  plans,  the 
number  of  items  inspected  per  lot  is  always  the  same  for  a  given  plan. 

Single-sampling  plans  for  fraction  defective  based  on  attributes  can 
sometimes  be  curtailed,  that  is,  terminated  without  inspecting  all  N 
items.  Curtailing  is  possible  as  soon  as  the  number  of  defectives  reaches 
the  rejection  number  or  the  number  of  defectives  plus  the  number  of 
items  remaining  to  be  inspected  is  less  than  the  rejection  number.  Plans 
based  on  variables  can  also  be  curtailed,  provided  there  is  some  minimum 
possible  value  of  the  characteristic  being  inspected.  Most  acceptance 
inspection  involves  quantities  that  can  under  no  circumstances  be  less 
than  zero,  so  curtailing  before  inspecting  all  N  items  is  technically  pos- 
sible. The  simplest  example  of  this  possibility  is  the  case  in  which,  with 
k  positive,  the  sum  of  fewer  than  N  measurements  exceeds  NU,  making 
it  clear  that  for  the  entire  TV  items  x  will  exceed  U  and  x  +  ks  will  exceed 
it  still  further.  That  there  are  other  cases,  less  extreme  but  more  compli- 
cated to  describe,  in  which  curtailing  is  possible  can  be  seen  by  studying 
Sec.    11.3    below.     Ordinarily,    however,    curtailing   is    not    considered 
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because  computations  are  not  started  until  all  measurements  have  been 
made.  When  it  is  possible  to  carry  out  simple  computations  while  the 
data  are  being  collected  and  it  is  important  to  minimize  the  amount  of 
inspection,  sequential  analysis  of  the  data  will  be  considerably  more 
efficient  than  merely  curtailing  a  single-sampling  plan.  A  brief  outline 
of  a  plan  for  sequential  analysis  of  fraction  defective  by  variables  is 
presented  in  Sec.  11.6. 

3.4.  Operating  Characteristics  (OC) 

There  are  evidently  many  possible  plans  of  action,  for  each  different 
combination  of  sample  size  and  multiplier  of  the  standard  deviation 

Lp 


1007c 


Fig.  1.3. 


100% 

-Typical  operating-characteristic  (OC)  curve. 


gives  a  different  plan  of  action.  If  a  large  number  of  lots  containing  a 
given  fraction  defective  p  are  submitted  to  a  given  plan,  a  certain  pro- 
portion Lp  are  accepted;  and  this  proportion  differs  from  one  plan  to 
another.  For  each  plan  of  action,  it  is  important  to  know  what  propor- 
tion of  submitted  lots  will  be  accepted  for  each  possible  quality  p  of 
submitted  lots,  as  a  plan  is  clearly  unsuitable  if  it  passes  too  many  of  the 
lots  of  unsatisfactory  quality  or  rejects  too  many  of  the  lots  of  acceptable 
quality  that  are  submitted  to  it. 

This  information  is  shown  by  the  operating-characteristic  (OC)  curve 
of  the  plan.  The  ordinate  of  the  OC  curve  shows  the  proportion  Lp  of 
lots  that  will  be  accepted  when  the  fraction  defective  in  submitted  lots 
is  p.  The  proportion  of  submitted  lots  of  quality  p  that  will  be  rejected 
is  1  —  Lp.  The  OC  curve  of  Fig.  1.3  is  typical:  it  shows  a  probability  of 
1  of  accepting  a  lot  having  no  defectives.     As  the  fraction  defective 
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increases,  the  probability  of  acceptance  declines,  slowly  at  first,  then  more 
rapidly,  and  finally  flattens  out.  It  shows  a  probability  of  0  of  accepting 
a  lot  having  only  defectives. 

It  should  be  emphasized  that  the  OC  curve  does  not  show  the  proba- 
bility that  an  accepted  lot  will  be  of  quality  p;  this  depends  upon  the 
probability  that  a  lot  of  quality  p  will  be  submitted,  as  well  as  upon  the 
probability  that  a  lot  of  quality  p  will  be  accepted  if  submitted.  The  OC 
curve  shows  only  the  probability  that  a  lot  of  quality  p  will  be  accepted 
if  submitted. 

The  OC  curve  may  be  thought  of  in  another  way,  which,  though  it 
requires  visualizing  a  three-dimensional  chart,  is  illuminating.  Consider 
Fig.  1.2,  in  which  the  lot  mean  jjl  is  plotted  on  the  horizontal  axis  and 
the  lot  standard  deviation  a  on  the  vertical  axis.  Any  point  in  this 
figure  for  which  a  >  0  represents  a  possible  lot.  Imagine  a  line  perpen- 
dicular to  the  plane  of  the  page  through  some  point  ii,a.  The  height  of 
the  perpendicular  represents  the  probability  LM,ff  that,  if  a  lot  whose  mean 
and  standard  deviation  correspond  to  the  point  /jl,ct  is  submitted,  the  plan 
of  action  will  accept  it.  If  an  OC  surface  is  constructed  by  erecting  such 
perpendiculars  at  all  possible  combinations  of  /u,  and  a,  it  will  be  found 
that  above  any  straight  line  in  the  \x-c  plane  passing  through  U  (as  for 
example,  the  sloping  line  in  Fig.  1.2)  all  the  values  of  !>,„  are  equal. 
vSince  all  lots  represented  by  a  straight  line  through  U  have  the  same 
value  of  K,  this  means  that  all  lots  having  the  same  value  of  K  have  the 
same  probability  of  acceptance.  Hence  the  OC  surface  can  be  reduced 
to  an  OC  curve  in  two  dimensions,  the  probability  of  acceptance  being 
plotted  against  the  value  of  K.  But,  as  was  pointed  out  in  Sec.  2,  all 
lots  having  the  same  value  of  K  have  the  same  fraction  defective.  Thus 
the  OC  curve  can  show  the  probability  of  acceptance  as  a  function  of  p, 
as  in  Fig.  1.3. 

4.  PLAN  OF  ACTION 
4.1.  General  Procedure 

The  first  step  in  selecting  the  values  of  k  and  N  that  determine  a 
particular  plan  is  to  select  two  points  through  which  it  is  desired  that  the 
OC  curve  pass.  The  first  point  corresponds  to  a  lot  fraction  defective  pi 
for  which  it  is  desired  that  the  probability  of  rejection  be  not  greater 
than  a.  The  second  point  corresponds  to  a  lot  fraction  defective  p2 
(?>2  >  Pi)  for  which  it  is  desired  that  the  probability  of  acceptance  be 
only  /?  (a  +  /?  <  1).  Fractions  defective  of  pi  or  less  thus  represent 
acceptable  quality;  a  is  the  maximum  producer's  risk,  that  is,  the  maxi- 
mum risk  that  an  acceptable  lot  will  be  rejected  by  the  sampling  plan. 
Similarly,  fractions  defective  of  p2  or  more  represent  unacceptable  qual- 
ity; ft  is  the  maximum  consumer's  risk,  that  is,  the  maximum  risk  that 
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an  unacceptable  lot  will  be  accepted  by  the  sampling  plan.     For  lots  of 
quality  better  than  pi  (fractions  defective  less  than  pi)  the  producer's 
risk  will  be  less  than  a,  and  for  lots  of  quality  worse  than  p2  (fractions 
defective  more  than  p2)  the  consumer's  risk  will  be  less  than  /?. 
The  two  points  on  the  OC  curve  thus  correspond  with 

p  =  px        LPl  =  1  —  a  (7) 

p  =  p2         LP2  =  (3  (8) 

From  these,  k  and  N  can  be  computed.  If  N  proves  too  large,  it  is 
necessary  to  select  two  points  on  a  less  discriminating  OC  curve.  The 
more  discriminating,  that  is,  the  steeper,  the  OC  curve,  the  larger  N; 
consequently,  N  may  be  reduced  by  widening  the  gap  between  pi  and  p2 
or  by  increasing  the  sum  of  the  risks  a  -\-  j3,  either  of  which  results  in 
a  natter  OC  curve. 

4.2.  Formulas  for  k  and  N 

Let  Kt,  where  0  <  e  <  1,  represent  a  normal  deviate  exceeded  with 
probability  e;  that  is,  Ke  is  defined  by 

1       C  °°    —^1 

?si.*"r*-'  (9) 

Values  of  K €  corresponding  with  various  common  values  of  e  are  given  in 
Table  1.1.  Four  values  of  e,  namely,  ph  p2,  a,  and  /3,  are  of  particular 
interest;  and  the  corresponding  values  of  K€  may  be  designated  Kh  K2, 
Ka,  and  Kp. 

The  value  of  k  may  be  computed  from  the  formula 

7        KaK2  +  KpKi  .     N 

k  =       Ka  +  Kf  (10) 

The  sample  size  can  be  determined  either  from 


or  from 


2(Ka  +  KpY  +  (KaK2  +  KftKtf 

2{K^  -  K2y  K     } 


*^&m 


The  value  of  JV  given  by  (11)  and  (12)  ordinarily  will  not  be  an  integer; 
it  should  be  rounded  to  the  next  higher  integer,  not  to  the  nearest  integer.  * 

Note  that  k  is  a  weighted  average  of  Ki  and  K2,  the  weights  being 
Kp  and  Ka,  respectively.  Thus,  if  both  a  and  (3  are  Jess  than  0.5,  k  is 
between  Ki  and  K2;  and  if  a  and  (3  are  equal,  k  =  (K\  -j-  K2)/2. 

*  Two  more  formulas  for  N  are  given  by  equations  (134)  and  (135),  Sec.  11.1.2. 
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TABLE  1.1 
Normal  Deviates  Exceeded  with  Given  Probability 


Values  of  Ke  for  various  e,  where 


V2 


=  f 


2  dx  =  £ 


e 

0.00 

0.0 

0.1 

0.2 

0.3 

0.4 

0 

00 

00 

1.28155 

0.84162 

0.52440 

0.25335 

1 

3.09023 

2.32635 

1.22653 

0.80642 

0.49585 

0 . 22754 

2 

2.87816 

2.05375 

1 . 17499 

0.77219 

0.46770 

0.20189 

3 

2.74778 

1.88079 

1 . 12639 

0.73885 

0.43991 

0.17637 

4 

2 . 65207 

1.75069 

1.08032 

0.70630 

0.41246 

0.15097 

5 

2.57583 

1.64485 

1.03643 

0 . 67449 

0.38532 

0.12566 

6 

2.51214 

1.55477 

0.99446 

0 . 64335 

0.35846 

0 . 10043 

7 

2.45726 

1.47579 

0.95417 

0.61281 

0  33185 

0.07527 

8 

2.40892 

1.40507 

0.91537 

0 . 58284 

0.30548 

0.05015 

9 

2.36562 

1.34076 

0.87790 

0.55338 

0.27932 

0.02507 

When  e  >  0.5,  Ke  =  -#i_£.     When  e  =  0.5,  K0.5  =  0. 

The  entries  in  column  (e)  are  additional  digits  for  the  values  of  e  shown  in  the  top 
row.  Thus,  the  entry  1.12639  in  column  (0.1)  and  row  (3)  corresponds  with  e  =  0.13; 
similarly,  the  entry  2.74778  in  column  (0  00)  and  row  (3)  corresponds  with  e  =  0.003. 

A  final  5  in  italic  indicates  that,  in  rounding,  the  nearest  odd  digit  should  be  used 
in  the  fourth  decimal  place.  When  a  final  5  is  not  in  italic,  follow  the  usual  rule  of 
rounding  0.5  to  the  nearest  even  integer.  Thus  a  final  pair  55  becomes  6,  35  becomes 
3,  85  becomes  9. 

For  a  detailed  table  see  The  Kelley  Statistical  Tables;  Table  I  gives  values  of  Ke 
(denoted  by  x)  to  8  decimal  places  for  values  of  e  (denoted  by  q)  from  0  to  0.5  by 
steps  of  0.0001.  Alternatively,  see  Fisher  and  Yates,  Statistical  Tables;  Table  I  gives 
values  of  Ke  (denoted  by  x)  to  6  decimal  places  for  values  of  e  (denoted  by  P/2)  from 
0  to  0.495  by  steps  of  0.005;  also  Table  IX  gives  values  of  Ke  -j-  5  to  4  decimal  places 
for  values  of  1  -  e  from  0  to  0.980  by  steps  of  0.001  and  from  0.9800  to  0  9999  by 
steps  of  0.0001. 

4.3.  Illustration 

To  illustrate  the  use  of  equations  (10),  (11),  and  (12),  suppose  that 
lots  are  considered  acceptable  if  the  percent  defective  is  15  or  less  and 
unacceptable  if  the  percent  defective  is  30  or  more,  and  suppose  the 
maximum  producer's  and  consumer's  risks  are  to  be  1  percent  and  2  per- 
cent, respectively.     Then 


pi  =  0.15 
7>2  =  0.30 


a  =  0.01 
p  =  0.02 


From  Table  1.1,  using  only  4  decimal  places,  we  find 


K2 


1.0364 
0.5244 


Ka  =  2.3263 
K*  =  2.0537 


(13) 
(14) 


(15) 
(16) 
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Substituting  from  (15)  and  (16)  into  (10)  gives 

2.3263  X  0.5244  +  2.0537  X  1.0364 

2.3263  +  2.0537  U  } 

_  3.3484 
"  4.3800 
=  0.7645 

Substituting  from  (15)  and  (16)  into  (11)  gives 

=  2  X  4.38002  +  3.34842 

2  X  0.51202  {     } 

=  94.6 

The  same  value  of  N  results  from  substituting  from  (15),  (16),  and  (17) 
into  (12) 

/  a    oonr\\  2 

2~    '  \0M2()J 
=  1.2922  X  73.1829 
=  94.6. 


N  _  0.7845'  +  2  /^SOOY 


Thus,  in  this  case  a  sample  of  95  items  would  be  drawn  at  random 
from  the  lot  and  a  measurement  of  the  characteristic  being  inspected 
would  be  made  on  each  item  of  the  sample.  The  sum  of  the  measure- 
ments and  the  sum  of  their  squares  would  be  computed.  These  sums 
would  be  substituted  into  equations  (4)  and  (6)  to  find  the  sample  mean 
and  standard  deviation,  x  and  s.  Finally,  x  +  0.7645s  would  be  com- 
puted; if  it  exceeded  U  (the  value  of  the  quality  characteristic  above 
which  an  item  is  considered  defective)  the  lot  would  be  rejected,  other- 
wise the  lot  would  be  accepted.  (Actually,  the  abbreviated  computa- 
tional procedures  described  in  Sec.  7  might  be  used.) 

4.4.  Adjustment  of  N 

If  the  sample  size  given  by  (11)  and  (12)  is  too  large,  two  new  points 
may  be  selected  in  such  a  way  that  the  OC  curve  will  be  less  steep.  For 
example,  if  p\  and  p2  are  kept  at  15  and  30  percent  but  a  and  /3  are 
doubled,  to  2  and  4  percent,  N  is  reduced  to  72;  or  if  a  and  /3  are  kept  at 
1  and  2  percent  but  pi  and  p2  are  changed  to  10  and  35  percent,  N  becomes 
32. 

4.5.  Adjustment  of  k  with  N  Fixed 

4.5.1.  Formula. — If  for  some  reason  N  is  definitely  fixed,  the  OC  curve 
can  be  made  to  pass  through  any  one  point  desired,  say 

p  =  pi         LPI  =  1  -  a  (20) 

by  setting 

k  =  Kl±  VW^Tb  (21) 
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where  the  plus  sign  is  to  be  used  if  a  >  0.5  and  the  minus  sign  if  a  <  0.5, 
and  where 

a=1-2(A^T)  (22) 


and 


6  =  K\  -  f  (23) 


4.5.2.  Example. — Suppose,  for  example,  that  a  sample  of  95  has  been 
decided  upon,  and  a  sampling  plan  is  desired  such  that  the  OC  curve 
passes  through  the  point  defined  by  (14).     Then  (20)  becomes 

pi  =  0.30        LPi  =  0.02  (24) 

and  from  Table  1.1  we  find 

K,  =  0.5244         Ka  =  -2.0537  (25) 

Hence 

=  0.9776 

4  9177 
b  =  0.2750  -  ^—^-  (27) 

=  0.2306 

ab  =  0.2254  (28) 

K\  -  ab  =  0.2750  -  0.2254  (29) 

=  0.0496 
VK{  -  ab  =  0.2227  (30) 

_  0.5244  +  0.2227  .         ' 

~  0.9776  {6i) 

=  0.7642 

This  differs  slightly  from  the  value  0.7645  yielded  by  (17)  for  a  sample 
of  95  and  an  OC  curve  passing  through  the  points  defined  by  (13)  and 
(14).  The  value  of  k  yielded  by  (31)  would  be  equal  (except  for  differ- 
ences due  to  rounding)  to  that  yielded  by  (17)  if  in  (22)  and  (23)  both  N 
and  N  —  1  were  taken  as  94.6. 

4.5.3.  A  Special  Case  :  Indifference  Quality. — If  N  is  fixed  and  k  is  to 
be  selected  so  that  the  OC  curve  will  pass  through  one  specified  point, 
this  point  might  reasonably  be  chosen  at  the  indifference  quality  p0. 
The  indifference  quality  is  the  quality  at  which  it  is  a  matter  of  complete 
indifference  whether  the  lot  is  accepted  or  rejected.*     Lots  of  better 

*  For  a  discussion  of  the  indifference  quality  as  a  key  to  the  selection  of  sampling- 
inspection  plans,  see  Statistical  Research  Group,  Sampling  Inspection,  Chap.  15,  Sec. 
2.2.1.3,  p.  145. 


Sec.  4.7  VARIABLES  21 

quality  should  be  accepted  more  often  than  rejected,  lots  of  worse  quality 
should  be  rejected  more  often  than  accepted,  and  lots  of  the  indifference 
quality  should  be  accepted  and  rejected  equally  frequently.  Thus,  a  in 
(20)  becomes  0.5,  Ka  becomes  0,  and  (21)  reduces  to  k  =  K0,  the  normal 
deviate  exceeded  with  probability  equal  to  the  indifference  quality. 

If  k  is  determined  from  the  indifference  quality,  it  does  not  depend 
upon  N,  for  the  term  in  (22)  and  the  term  in  (23)  that  involve  N  drop 
out  when  Ka  =  0.  Thus  if  k  is  determined  from  the  indifference  quality, 
changes  in  N  will  not  change  the  value  of  p  for  which  acceptance  and 
rejection  are  equally  likely,  nor  will  they  require  changes  in  k.  Roughly 
speaking,  changes  in  N  affect  the  steepness  of  the  curve,  while  changes  in 
k  affect  its  location  with  respect  to  the  quality  scale. 

4.6.  Approximations  in  the  Plan 

Three  kinds  of  approximation  are  involved  in  using  formulas  (10), 
(11),  and  (12).  First,  there  is  the  approximation  involved  in  assuming 
the  quality  characteristic  to  be  normally  distributed.  Second,  certain 
mathematical  approximations  are  involved  in  the  derivation  of  the 
formulas  (see  Sec.  11.2).  Third,  the  value  of  N  given  by  formulas  (11) 
and  (12)  is  not  ordinarily  an  integer,  so  an  approximation  is  involved  in 
substituting  an  integer.  The  effect  of  the  first  approximation  is  dis- 
cussed later  (Sec.  9.4).  Even  if  the  quality  characteristic  were  exactly 
normally  distributed,  the  result  of  the  second  and  third  approximations 
would  be  that  the  true  OC  curve  would  not  pass  precisely  through  the 
points  intended;  but  it  would  come  sufficiently  close  for  all  practical 
purposes.  In  the  present  illustration,  for  example,  taking  N  =  95  and 
k  =  0.7645  results  in  a  producer's  risk  of  1  percent  at  a  percent  defective 
of  15.04  instead  of  15.  The  inaccuracies  due  to  the  second  and  third 
approximations  tend  to  be  larger  for  smaller  samples,  both  because  the 
effect  of  the  mathematical  approximations  is  larger  and  because  the 
effect  of  rounding  N  is,  on  the  average,  larger;  this  point  is  discussed 
further  in  Sees.  4.7  and  11.2. 

4.7.  Table  of  N  and  h 

Table  1.2  contains  values  of  N  and  k  for  151  combinations  of  px  and  p2. 
The  values  of  pi  in  the  table  range  from  0.001  to  0.05;  the  values  of  p2 
vary  from  about  %px  or  %px  up  to  10pi  or  more.  The  values  of  a  are 
in  all  cases  0.05  and  the  values  of  /3  are  0. 10.  The  values  of  N  and  k  have 
been  computed  by  methods  explained  in  Sec.  11.5  that  do  not  involve 
the  mathematical  approximations  used  in  formulas  (10)  and  (11)  or  (12), 
so  they  may  be  slightly  different  from,  and  more  accurate  than,  the 
values  given  by  (10)  and  (11)  or  (12).  This  is  especially  true  when  N 
is  small,  not  only  because  of  the  mathematical  approximations  under- 
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TABLE  1.2 
Values  of  N  and  k  for  Various  Combinations  of  pi  and  p2  when  a  =  0.05  and 

jS  =  0,10 

N  is  the  sample  size;  k  is  the  multiple  of  the  standard  deviation  in  the  criterion 
x  +  ks  <  U  for  accepting  the  lot,  where  U  is  the  value  above  which  the  characteristic 
being  inspected  is  regarded  as  defective;  px  is  the  acceptable  quality  level  (AQL); 
p2  is  the  lot  tolerance  percent  defective  (LTPD);  a  =  0.05  is  the  producer's  risk;  and 
/3  =  0.10  is  the  consumer's  risk.  The  true  pi  and  true  p2  are  the  values  at  which,  for 
the  corresponding  N  and  k,  a  =  0.05  and  13  =  0.10;  the  difference  between  the  true 
and  intended  values  of  pi  and  p2  arises  from  the  fact  that  N  must  be  an  integer. 

This  table  has  been  computed  with  the  aid  of  Table  IV  in  Johnson  and  Welch, 
" Applications  of  the  Non-central  ^-Distribution."  The  entries  for  N,  k,  true  ph  and 
true  p2  are  therefore  slightly  more  accurate  than  those  given  by  formulas  (10),  (11) 
or  (12),  and  (42),  respectively.     See  Sec.  11.5  on  the  construction  of  this  table. 


Intended 

Intended 
p2 

N 

k 

True 
Pi 

True 

P2 

0.001 

0.0015 

3177 

3.0214 

0.0010 

0.0015 

0.002 

1032 

2.9711 

0.0010 

0.0020 

0 . 0025 

567 

2.9311 

0.0010 

0  0025 

0.003 

381 

2.8979 

0.0010 

0.0030 

0.004 

226 

2.8441 

0.0010 

0.0040 

0.005 

161 

2.8018 

0.0010 

0.0050 

0.006 

125 

2.7660 

0.0010 

0.0061 

0.007 

103 

2.7359 

0.0010 

0.0071 

0.008 

87 

2.7153 

0.0010 

0.0080 

0.009 

76 

2 . 6832 

0.0010 

0.0092 

0.01 

68 

2.6725 

0.0010 

0.0099 

0.012 

56 

2 . 6220 

0.0010 

0.0122 

0.015 

45 

2 . 5730 

0.0010 

0.0153 

0.02 

34 

2  5208 

0  0010 

0.0201 

0.025 

28 

2.4712 

0.0010 

0.0250 

0.03 

24 

2.4392 

0  0010 

0.0294 

0.035 

21 

2 . 3930 

0.0010 

0.0351 

0.04 

19 

2.3745 

0.0010 

0.0389 

0.05 

16 

2 . 3059 

0.0010 

0.0494 

0.06 

14 

2 . 2570 

0.0010 

0 . 0590 

0.0025 

0.004 

1677 

2.7199 

0 . 0025 

0 . 0040 

0.005 

736 

2 . 6772 

0 . 0025 

0.0050 

0.006 

443 

2.6413 

0.0025 

0.0060 

0.0075 

268 

2.5967 

0.0025 

0.0075 

0.01 

158 

2 . 5376 

0.0025 

0.0101 

, 

0.012 

118 

2.4979 

0.0025 

0.0121 

0.015 

86 

2  4576 

0.0025 

0.0149 

0.02 

59 

2.3875 

0.0025 

0.0202 

0.025 

46 

2.3410 

0.0025 

0.0249 

0.03 

37 

2.2981 

0 . 0025 

0.0301 

0.035 

32 

2.2610 

0.0025 

0.0348 

0.04 

28 

2.2354 

0.0025 

0.0391 

0.05 

22 

2.1724 

0 . 0025 

0.0501 

0.06 

19 

2.1321 

0.0025 

0.0585 
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Intended 
Pi 

Intended 
pi 

N 

k 

True 
Pi 

True 

0.005 

0.0075 

1713 

2.4952 

0.0050 

0.0075 

0.01 

547 

2.4357 

0 . 0050 

0.0100 

0.012 

328 

2 . 3967 

0.0050 

0.0120 

0.015 

197 

2 . 3482 

0.0050 

0.0151 

0.02 

114 

2 . 2824 

0.0050 

0 . 0202 

0.025 

80 

2.2297 

0.0051 

0.0253 

0.03 

61 

2.1886 

0.0050 

0.0302 

0.035 

50 

2.1546 

0.0050 

0 . 0349 

0.04 

42 

2 . 1272 

0.0049 

0.0395 

0.05 

32 

2.0723 

0.0049 

0.0492 

0.06 

26 

2 . 0265 

0.0049 

0.0587 

0.07 

22 

1 . 9874 

0.0049 

0.0680 

0.0075 

0.01 

2907 

2.3728 

0.0075 

0.0100 

0.012 

1039 

2.3339 

0.0075 

0.0120 

0.015 

451 

2.2851 

0.0075 

0.0150 

0.02 

208 

2.2198 

0.0075 

0.0201 

0.025 

130 

2.1700 

0.0075 

0.0250 

0.03 

93 

2.1274 

0.0075 

0.0299 

0.035 

72 

2.0891 

0.0075 

0.0350 

0.04 

58 

2.0555 

0.0075 

0.0401 

0.05 

42 

1.9996 

0.0075 

0.0500 

0.06 

33 

1.9501 

0.0075 

0.0601 

0.07 

27 

1.9093 

0.0075 

0.0700 

0.08 

23 

1.8714 

0.0075 

0.0800 

0.01 

0.015 

1231 

2.2385 

0.0100 

0.0150 

0.02 

389 

2.1733 

0.0100 

0 . 0200 

0.025 

208 

2.1205 

0.0100 

0.0251 

0.03 

137 

2.0761 

0.0101 

0.0302 

0.035 

101 

2.0372 

0.0101 

0.0353 

0.04 

79 

2.0041 

0.0101 

0.0403 

0.045 

64 

1.9776 

0.0100 

0.0451 

0.05 

54 

1.9527 

0.0099 

0 . 0499 

0.06 

41 

1.9007 

0.0100 

0.0602 

0.07 

33 

1.8561 

0.0101 

0.0704 

0.08 

28 

1.8271 

0.0099 

0.0787 

0.09 

24 

1.7974 

0.0098 

0.0880 

0.10 

21 

1.7643 

0.0099 

0.0979 
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TABLE  1.2.- 

—(Continued) 

Intended 
Pi 

Intended 

N 

k 

True 
Pi 

True 

V2 

0.015 

0.02 

2035 

2 . 1047 

0.0150 

0.0200 

0.025 

603 

2.0520 

0.0150 

0 . 0250 

0.03 

310 

2.0077 

0.0150 

0.0301 

0.035 

197 

1 . 9688 

0.0151 

0.0352 

0.04 

141 

1.9348 

0.0151 

0 . 0402 

0.045 

108 

1.9033 

0.0151 

0 . 0453 

0.05 

87 

1.8814 

0.0149 

0 . 0497 

0.06 

61 

1.8271 

0.0151 

0.0604 

0.07 

47 

1.7864 

0.0151 

0.0701 

0.08 

38 

1.7468 

0.0152 

0.0801 

0.09 

32 

1.7223 

0.0148 

0.0884 

0.10 

27 

1 . 6856 

0.0149 

0.0994 

0.11 

24 

1 . 6548 

0.0152 

0.1088 

0.12 

21 

1 . 6330 

0.0149 

0.1183 

0.13 

19 

1.6002 

0.0153 

0.1288 

0.14 

17 

1.5767 

0.0152 

0.1392 

0.15 

15 

1.5528 

0.0148 

0.1514 

0.02 

0.03 

835 

1 . 9565 

0.0200 

0.0300 

0.035 

417 

1.9179 

0.0200 

0.0351 

0.04 

260 

1.8836 

0-0201 

0.0401 

0.045 

182 

1.8523 

0.0201 

0.0452 

0.05 

138 

1.8247 

0.0201 

0.0501 

0.06 

90 

1.7680 

0.0205 

0.0609 

0.07 

65 

1.7364 

0.0199 

0.0697 

0.08 

50 

1.6971 

0.0199 

0.0800 

0.09 

41 

1.6617 

0 . 0200 

0 . 0898 

0.10 

34 

1 . 6290 

0.0200 

0.1001 

0.11 

29 

1 . 5968 

0.0201 

0.1107 

0.12 

26 

1.5722 

0.0203 

0.1191 

0.13 

23 

1.5473 

0.0203 

0.1287 

0.15 

18 

1 . 5037 

0.0196 

0 .  1495 

0.17 

15 

1.4623 

0.0195 

0.1692 

0.20 

12 

1.4041 

0.0196 

0.1986 

0.03 

0.04 

1334 

1 . 8077 

0.0300 

0.0400 

0.045 

643 

1.7766 

0.0300 

0.0451 

0.05 

390 

1.7483 

0.0300 

0.0501 

0.06 

198 

1 . 6979 

0.0301 

0.0601 

0.07 

125 

1.6539 

0.0302 

0.0702 

, 

0.08 

88 

1.6083 

0.0306 

0.0812 

0.09 

67 

1 . 5844 

0.0298 

0.0896 

0.10 

53 

1.5517 

0.0298 

0.0998 

0.11 

44 

1.5236 

0.0298 

0.1092 

0.12 

37 

1 .4955 

0.0298 

0.1194 

0.13 

32 

1.4704 

0.0298 

0.1290 

0.15 

25 

1.4224 

0.0299 

0.1485 

0  20 

15 

1.3149 

0.0297 

0.2013 

0.25 

11 

1.2281 

0.0305 

0.2477 

0.30 

8 

1 . 1553 

0.0293 

0 . 3007 
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Intended 
Pi 

Intended 

N 

k 

True 
Pi 

True 

0.04 

0.06 

524 

1 . 6406 

0.0400 

0.0601 

0.07 

259 

1 . 5964 

0.0401 

0.0701 

0.08 

160 

1 . 5569 

0.0402 

0 . 0802 

0.09 

111 

1.5209 

0.0402 

0 . 0903 

0.10 

83 

1.4920 

0.0399 

0.0997 

0.11 

65 

1.4621 

0.0399 

0 . 1098 

0.12 

53 

1.4348 

0.0398 

0.1196 

0.13 

45 

1.4112 

0.0398 

0.1286 

0.14 

38 

1.3793 

0.0403 

0.1402 

0.15 

33 

1.3565 

0.0402 

0.1500 

0.17 

26 

1.3195 

0.0397 

0.1681 

0.20 

19 

1 , 2585 

0.0396 

0.1988 

0.25 

13 

1 . 1647 

0.0405 

0.2491 

0.30 

9 

1.0967 

0.0378 

0.3027 

0.35 

7 

1.0274 

0.0376 

0 . 3524 

0.40 

6 

0 . 9480 

0.0412 

0 . 3963 

0.05 

0.07 

661 

1 . 5499 

0.0500 

0.0701 

0.08 

320 

1.5102 

0.0501 

0.0801 

0.09 

194 

1.4740 

0.0501 

0.0902 

0.10 

133 

1.4408 

0 . 0502 

0 . 1004 

0.11 

99 

1.4096 

0 . 0504 

0.1104 

0.12 

77 

1.3826 

0 . 0503 

0.1202 

0.13 

62 

1.3567 

0.0502 

0.1302 

0.14 

52 

1.3365 

0.0498 

0.1389 

0.15 

44 

1.3129 

0 . 0498 

0.1488 

0.16 

38 

1 . 2857 

0.0503 

0.1597 

0.17 

33 

1.2636 

0.0503 

0.1700 

0.20 

24 

1 . 2047 

0 . 0506 

0.1985 

0.25 

15 

1.1189 

0 . 0494 

0 . 2499 

0.30 

11 

1 . 0386 

0.0503 

0.2981 

0.35 

8 

0.9633 

0.0493 

0.3537 

0.40 

6 

0  9162 

0.0448 

0.4052 

lying  (10)  and  (11)  or  (12),  but  also  because  of  the  rounding  of  N.  For 
example,  if  pi  =  0.04,  p2  =  0.40,  a  =  0.05,  and  0  =  0.10,  Table  1.2 
shows  N  =  6  and  k  =  0.9480.  The  table  also  shows  that  for  these  values 
of  N  and  k  the  true  values  of  pi  and  p2,  which,  because  N  must  be  an 
integer,  cannot  be  made  precisely  equal  to  the  intended  values,  are 
0.0412  and  0.3963.  Formulas  (10)  and  (11)  yield  for  this  example 
k  =  0.9091  and  N  =  5.4,  which  is  rounded  upward  to  N  =  6;  for  TV  =  6 
and  k  =  0.9091  the  true  pi  and  p2  are  0.0457  and  0.4073.  If  k  is  adjusted 
by  formula  (21),  taking  N  =  6,  p  =  0.04,  and  a  =  0.05,  it  becomes 
0.9247,     For  this  adjusted  value  of  k  the  true  pi  and  p2  are  0.0439  and 
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0.4029.  Thus,  even  for  this  case  where  N  is  only  6,  it  is  questionable 
whether  the  greater  accuracy  of  Table  1.2  or  of  the  adjusted  k,  as  com- 
pared with  formulas  (10)  and  (11)  or  (12),  is  of  substantial  value  for  most 
practical  purposes ;  and  for  larger  values  of  N  the  differences  are  even  less 
important. 

5.  AMOUNT  OF  INSPECTION 

5.1.  Comparison  with  Attribute  Inspection 

If  each  item  were  classified  simply  defective  or  nondefective  and  the 
acceptability  of  the  lot  judged  by  the  number  of  defectives  in  a  single 
sample,  a  sample  of  145  would  be  required  for  an  OC  curve  passing 
through  the  points  defined  by  (13)  and  (14).*  If  the  sampling  were  done 
sequentially  by  attributes,  the  average  number  of  items  inspected  would 
be  20  for  lots  with  no  defectives,  63  for  lots  15  percent  defective,  133  for 
lots  22  percent  defective,  61  for  lots  30  percent  defective,  and  7  for  lots 
100  percent  defective,  f 

TABLE  1.3 
Sample  Sizes  for  Eight  Pairs  of  Single-sampling  Plans  Based  on  Attributes 

and  Variables 
a  =  0.05,         13  =  0.10  for  all  plans 
Na  =  sample  size  for  attributes 
Nt  =  sample  size  for  variables 


Plan 
number 

Pi 

P2 

Nt 

Na 

Na/Nt 

Saving 

(Na   -  Nt)/Na 
percent 

1 

0.001 

0.002 

1,032 

12,476 

12.09 

91.7 

2 

0.001 

0.06 

14 

46 

3  29 

69.6 

3 

0.01 

0.015 

1,231 

4,185 

3.40 

70.6 

4 

0.01 

0.10 

21 

44 

2.10 

52.3 

5 

0.05 

0.07 

661 

1,199 

1.81 

44.9 

6 

0.05 

0.40 

6 

10 

1.67 

40.0 

7 

0.15 

0.20 

323 

493 

1.53 

34.5 

8 

0.15 

0.40 

17 

26 

1.53 

34.6 

For  attributes  the  sample  sizes  have  been  computed  as  explained  in  Chap.  7.  For  variables  they 
have  been  computed  by  the  method  described  in  Sec.  11.5  of  this  chapter,  which  sometimes  gives  values 
slightly  different  from,  and  slightly  more  accurate  than,  those  given  by  formulas  (11)  and  (12). 


*  See  Chap.  7,  Sec.  3,  equation  (21)  and  first  footnote  on  p.  260. 

f  See  Statistical  Research  Group,  Sequential  Analysis,  Sec  2  (second  revision), 
where  the  example  defined  by  equations  (13)  and  (14)  above  is  treated  in  some  detail. 
Sequential  analysis  also  can  be  used  with  variables  (see  Sec.  11.6  below)  and  will 
result  in  a  reduction  in  the  average  amount  of  inspection  similar  to  that  with  attributes 
— roughly  40  to  60  percent  (see  Sequential  Analysis,  Appendix  A,  p.  A. 03). 
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The  savings  in  the  number  of  items  inspected  by  using  variables 
instead  of  attributes  depends  partly  upon  the  steepness  of  the  OC  curve 
and  partly  upon  its  location.  The  steeper  the  OC  curve,  the  greater  the 
saving  from  using  variables.  The  better  the  lot  qualities  involved  (that 
is,  the  smaller  the  percentages  defective),  the  greater  the  saving.  Eight 
comparisons  of  attribute  and  variables  plans  are  shown  in  Table  1.3.  In 
each  case,  a  =  0.05  and  /3  =  0.10.  Na  and  Nt  represent  the  sample  sizes 
for  single-sampling  plans  based  on  attributes  and  variables,  respectively. 

5.2.  Comparison  with  Variables  Inspection  When  the  Standard  Devia- 
tion Is  Known 

If  the  true  standard  deviation  of  the  characteristic  being  inspected 
is  given,  the  proportion  defective  depends  only  upon  the  mean,  being  pi 
or  less  if  the  mean  is  U  —  K\a  or  less,  and  p2  or  more  if  the  mean  is 
U  —  K2a  or  more.  *  If  the  standard  deviation  were  known,  a  sample  of 
73  would  be  required  for  an  OC  curve  passing  through  the  points  defined 
by  (13)  and  (14).  The  reduction  of  22  observations  (23  percent)  from 
the  95  needed  when  the  standard  deviation  is  not  known  thus  represents 

TABLE  1.4 

Sample  Sizes  for  Eight  Pairs  of  Single-sampling  Variables  Plans  Based  on 

Known  and  Unknown  Standard  Deviations 

a  =  0.05,         0  =  0.10  for  all  plans 

Nt  =  sample  size  with  standard  deviation  unknown 

Nm  =  sample  size  with  standard  deviation  known 


Plan 
number 

Vi 

V2 

Nt 

Nm 

Nt/Nm 

Saving 

(Nt  -  Nm)/Nt 
percent 

1 

0.001 

0.002 

1,032 

190 

5.43 

81.6 

2 

0.001 

0  06 

14 

4 

3.50 

71.4 

3 

0.01 

0.015 

1,231 

351 

3.51 

71.5 

4 

0.01 

0.10 

21 

8 

2.62 

61.9 

5 

0.05 

0.07 

661 

300 

2.20 

54.6 

6 

0.05 

0.40 

6 

4 

1.50 

33.3 

7 

0.15 

0.20 

323 

226 

1.43 

30.0 

8 

0.15 

0.40 

17 

14 

1.21 

17.6 

For  the  known  standard  deviation,  the  sample  sizes  are  given  by  1/a,  where  a  is  defined  by  equation 
(137),  Sec.  11.1  of  this  chapter.  For  the  unknown  standard  deviation  they  have  been  computed  by  the 
method  described  in  Sec.  11.5  of  this  chapter,  which  sometimes  gives  values  slightly  different  from,  and 
slightly  more  accurate  than,  those  given  by  formulas  (11)  and  (12). 

the  savings  in  inspection  yielded  by  knowledge  of  the  standard  deviation,  f 
This  saving,  like  that  yielded  by  sufficient  knowledge  of  the  form  of 

*  The  use  of  variables  in  inspection  for  proportion  defective  when  the  standard 
deviation  is  known  is  discussed  by  Romig,  Allowable  Average  in  Sampling  Inspection. 

t  Incidentally,  22  is  the  number  of  observations  required  for  an  OC  curve  satisfy- 
ing (13)  and  (14)  if  the  true  mean  is  known  and  only  the  standard  deviation  need  be 
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distribution  to  permit  the  use  of  variables  instead  of  attributes,  varies 
with  the  OC  curve.  Comparisons  between  Nt,  the  sample  size  for  the 
variables  plans  with  unknown  mean  and  standard  deviation,  and  Nm,  the 
sample  size  with  known  standard  deviation,  are  shown  in  Table  1.4  for 
eight  plans  defined  by  the  same  pairs  of  OC  points  used  in  Table  1.3.  In 
general,  the  percentage  saving,  computed  before  Nt  and  Nm  are  rounded 
to  integers,  is  100/c2 /(k2  +  2),  but  for  small  samples  it  may  differ  from 
this  on  account  of  rounding. 

6.  OPERATING-CHARACTERISTIC  CURVE 

6.1.  Introduction 

The  values  of  k  and  N  may  be  obtained  from  (10)  and  (11)  or  (12) 
so  that  the  OC  curve  will  pass  through  two  specified  points.  They  may 
be  obtained  by  selecting  N  arbitrarily  and  computing  k  from  (21)  so 
that  the  OC  curve  will  pass  through  one  specified  point.  They  may  both 
be  determined  arbitrarily.  Regardless  of  how  they  are  chosen,  their 
effects  must  be  evaluated  in  terms  of  the  OC  curve  to  which  they  give 
rise.  How  will  they  operate  with  lots  of  every  quality  that  might  con- 
ceivably be  offered  ?  If  a  lot  is  good,  what  is  the  risk  that  the  sample 
from  it  will  fail  to  meet  the  acceptance  requirement?  If  a  lot  is  bad, 
what  is  the  risk  that  the  sample  will  satisfy  the  acceptance  criterion? 
The  OC  curve  shows,  for  every  possible  lot  quality  p,  the  probability  Lp 
that  a  lot  of  that  quality  will,  if  submitted,  be  accepted  on  the  basis  of  the 
sampling  plan. 

6.2.  Five-point  OC  Curve 

Three  points  on  the  OC  curve  are  easily  obtained:  (1)  When  there 
are  no  defectives  in  the  lot,  acceptance  is  certain;  (2)  when  the  lot  con- 
tains only  defectives,  rejection  is  certain;  (3)  when  the  proportion  of 
defectives  in  the  lot  is  p0,  such  that  KPo  =  k,  acceptance  and  rejection  are 
equally  likely.     Thus  three  points  on  the  OC  curve  are  given  by 

v  =  o  U  =  1  (32) 

p  =  p0        LP0  =  0.5  (33) 

V  =  1  Li  -  0  (34) 

used  in  testing  percent  defective.  In  general,  if  Nt  is  the  number  of  observations 
required  when  neither  the  mean  nor  the  standard  deviation  is  known,  Nm  the  number 
required  when  the  standard  deviation  is  known,  and  Ns  the  number  required  when 
the  mean  is  known,  the  relation  Nt  =  Nm  +  Ns  will  hold;  in  fact,  Nm  =  2Nt/(k2  +  2) 
and  Ns  =  k2Nt/(k2  +  2).  See  Chap.  8  for  methods  of  determining  the  sample  size 
Ns  when  inspection  is  based  on  the  standard  deviation;  the  sample  size  Nm,  when 
inspection  is  based  on  the  mean,  can  be  determined  from  equation  (148)  in  Sec.  11.1 
of  this  chapter;  it  is  1  /a,  where  a  is  defined  by  equation  (137). 
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where  p0  is  the  probability  that  a  normal  deviate  will  exceed  k.  If  k  and 
N  have  been  determined  by  (10)  and  (11)  or  (12),  the  OC  curve  will  also 
pass  through  the  two  points 


V  =  Pi 


LP1  =  1 


V  =  P2         LP2  = 


(35) 
(36) 


A  sketch  of  the  OC  curve  made  from  these  five  points  is  sufficient  for 
many  purposes.  For  the  illustrative  example  of  Sec.  4.3,  for  example, 
we  have 


V  =  0 

Vi  =  0.15 
p0  =  0.2223 
p2  =  0.30 

V  =  1 


U  =  1 

Lo.15  =  0.99 

£>0.2223  =    0.5 

L0  so  =  0.02 

Ll  =  0 


(37) 
(38) 
(39) 
(40) 
(41) 
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1.4 
defective 


18  20  22  24  26  28 

Percentage  of  defective  items  in  submitted  lots 
Operating-characteristic  (OC)  curve  for  a  variables  inspection  plan  for  percent 
N   =  95,  k   =  0.7645.      [Data  from  Table  1.6,  cohimns  (6),  (7),  (8).] 
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The  central  three  of  these  five  points  are  represented  by  dots  in  Fig.  1.4; 
the  two  extreme  points  are  not  shown  because  of  the  incompleteness  of  the 
p  scale. 

6.3.  Detailed  OC  Curve 

As  many  points  on  the  OC  curve  as  desired  may  be  found  from  the 
following  formula,  which  expresses  the  normal  deviate  corresponding  with 
the  lot  quality  in  terms  of  the  normal  deviate  corresponding  with  any 
given  probability  of  acceptance: 


KP  =  k-KLp^  +  fi%:   ,.. 


N   '   2(N  -  1) 


Formula  (42)  yields  the  normal  deviate  corresponding  with  p.  The  value 
of  p  can  be  found  from  Table  1.5,  which  shows  the  probabilities  of 
exceeding  various  normal  deviates. 

If  p  is  to  be  computed  for  a  number  of  values  of  Lp  in  order  to  plot 
the  entire  OC  curve,  it  is  convenient  while  computing  (42)  to  compute 


Kp,  =  h  +  KLp^±  +  ni,:    ,,  d3) 


N    '   2(N  -  1) 


If  it  is  desired  to  find  the  probability  of  acceptance  for  a  lot  of  given 
quality,  (42)  and  (43)  can  be  rewritten  as 

KLp  -      p.   k~Kl  (44) 


'SIN 


N   '   2(JV  -  1) 
KLpt  =  -KLp  (45) 

6.3.1.  Illustrative  Computation  for  One  Point. — To  illustrate  the  use 
of  formulas  (42)  and  (43),  consider  an  acceptance-inspection  plan  defined 
by  N  =  95  and  k=  0.7645.  For  what  lot  quality  is  the  probability  of 
rejection  1  percent? 

We  have 


N  =  95         k  =  0.7645         #0.99  -  -2.32635  (46) 


Then 


4 


r  +  iw^T)  =0-11677  w 


^•JIm  ,,*.■"  .,--"■- 


2(N  -  1) 


Substituting  into  (42), 

Kp  =  1.03615  (49) 
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TABLE  1.5 
Probabilities  Corresponding  with  Given  Normal  Deviates 


Values  of  e  for  various Kt,  where  — ■==  I 


e         dx 


0.0 
0.1 
0.2 
0.3 
0.4 
0.5 
0.6 
0.7 
0.8 
0.9 
1.0 
1.1 
1.2 
1.3 
1.4 
1.5 
1.6 
1.7 
1.8 
1.9 
2.0 
2.1 
2.2 
2.3 
2.4 
2.5 
2.6 
2.7 
2.8 
2.9 
3.0 


0     1 


0.5000 
0.4602 
0.4207 
0.3821 
0.3446 
0.3085 
0.2743 
0.2420 
0.2119 
0.1841 
0.1587 
0.1357 
0.1151 
0.0968 
0.0808 
0.0668 
0.0548 
0.0446 
0.0359 
0.0287 
0.0228 
0.0179 
0.0139 
0.0107 
0.0082 
0.0062 
0.0047 
0.0035 
0.0026 
0.0019 
0.0013 


4960 
4562 
4168 
3783 
3409 
3050 
2709 
2389 
2090 
1814 
1562 
1335 
1131 
0951 
0793 
0655 
0537 
0436 
0351 
0281 
0222 
0174 
0136 
0104 
0080 
0060 
0045 
0034 
0025 
0018 
0013 


4920 
4522 
4129 
3745 
3372 
3015 
2676 
2358 
2061 
1788 
1539 
1314 
1112 
0934 
0778 
0643 
0526 
0427 
0344 
0274 
0217 
0170 
0132 
0102 
0078 
0059 
0044 
0033 
0024 
0018 
0013 


4880 
4483 
4090 
3707 
3336 
2981 
2643 
2327 
2033 
1762 
1515 
1292 
1093 
0918 
0764 
0630 
0516 
0418 
0336 
0268 
0212 
0166 
0129 
0099 
0075 
0057 
0043 
0032 
0023 
0017 
0012 


4840 
4443 
4052 
3669 
3300 
2946 
2611 
2296 
2005 
1736 
1492 
1271 
1075 
0901 
0749 
0618 
0505 
0409 
0329 
0262 
0207 
0162 
0125 
0096 
0073 
0055 
0041 
0031 
0023 
0016 
0012 


4801 
4404 
4013 
3632 
3264 
2912 
2578 
2266 
1977 
1711 
1469 
1251 
1056 
0885 
0735 
0606 
0495 
0401 
0322 
0256 
0202 
0158 
0122 
0094 
0071 
0054 
0040 
0030 
0022 
0016 
0011 


4761 
4364 
3974 
3594 
3228 
2877 
2546 
2236 
1949 
1685 
1446 
1230 
1038 
0869 
0721 

.0594 
0485 
0392 
0314 
0250 
0197 

.0154 
0119 
0091 
0069 
0052 
0039 
0029 
0021 
0015 
0011 


4721 
4325 
3936 
3557 
3192 
2843 
2514 
2206 
1922 
1660 
1423 
1210 
1020 
0853 
0708 
0582 
0475 
0384 
0307 
0244 
0192 
0150 
0116 
0089 
0068 
0051 
0038 
0028 
0021 
0015 
0011 


4681 
4286 
3897 
3520 
3156 
2810 
2483 
2177 
1894 
1635 
1401 
1190 
1003 
0838 
0694 
0571 
0465 
0375 
0301 
0239 
0188 
0146 
0113 
0087 
0066 
0049 
0037 
0027 
0020 
0014 
0010 


.4641 
.4247 
.3859 
.3483 
.3121 
.2776 
.2451 
.2148 
.1867 
.1611 
.1379 
.1170 
.0985 
.0823 
.0681 
.0559 
.0455 
.0367 
.0294 
.0233 
.0183 
.0143 
.0110 
.0084 
.0064 
.0048 
.0036 
.0026 
.0019 
.0014 
.0010 


The  entries  in  row  (Ke)  are  additional  digits  for  the  values  of  Ke  shown  in  the  first 
column.  Thus,  the  entry  0.0934  in  column  (2)  and  row  (1.3)  corresponds  with 
Kt  =  1.32;  similarly,  the  entry  0.4840  in  column  (4)  and  row  (0.0)  corresponds  with 
Kt  =  0.04. 

A  5  in  italic  indicates  that,  in  rounding,  the  nearest  odd  digit  should  be  used  in 
the  preceding  decimal  place.  When  a  5  is  not  in  italic,  follow  the  usual  rule  of 
rounding  to  the  nearest  even  digit.  Thus  a  final  pair  55  becomes  6,  35  becomes  3, 
85  becomes  9. 

For  a  detailed  table  see  New  York  Mathematical  Tables  Project  Tables  of  Prob- 
ability Functions.  Table  I  gives  values  of  1  —  2e  to  15  decimal  places  for  values  of 
Kt  (denoted  by  x)  from  0  to  1  by  steps  of  0.0001,  and  for  all  values  of  Ke  above  1  by 
steps  of  0.001. 
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Interpolating  linearly  in  Table  1.5,  we  find  p  =  15.01  percent.*  Simi- 
larly, on  substituting  into  (43)  we  find  Kv>  =  0.49285;  sop  =  31.11  per- 
cent for  Lp  —  0.01. 

6.3.2.  Work  Sheet  for  Detailed  OC  Curve.— When  a  number  of  points 
on  the  OC  curve  are  needed,  it  is  convenient  to  arrange  the  computations 
in  a  systematic  way,  as  for  example  that  shown  in  Table  1.6,  which 
contains  the  computations  for  the  OC  curve  of  the  example  of  Sec.  4.3. 

TABLE  1.6 
Work  Sheet  for  Operating-characteristic  Curve 


Basic  quantities:  N  =  95;         k  =  0.7645: 


-tf 


+ 


k* 


2(N  -  1) 


0.11677 


(1) 

(2) 

(3) 

(4) 

(5) 

(6) 

(7) 

(8) 

Lp 

Klp 

S   X  (2) 

k   -  (3) 

k   +  (3) 

P 

V' 

1  -  (1)  =  Zy 

0.005 

2  57583 

0.30078 

0.46372 

1.06528 

0.321 

0.143 

0.995 

0.01 

2.32635 

0.27165 

0.49285 

1.03615 

0.311 

0.150 

0.99 

0.02 

2.05375 

0 . 23982 

0.52468 

1.00432 

0.300 

0.158 

0.98 

0.03 

1.88079 

0.21962 

0 . 54488 

0.98412 

0.293 

0.163 

0.97 

0.04 

1 . 75069 

0 . 20443 

0 . 56007 

0.96893 

0.288 

0.166 

0.96 

0.05 

1 . 64485 

0.19207 

0 . 57243 

0.95657 

0.284 

0.169 

0.95 

0.15 

1.03643 

0.12102 

0 . 64348 

0.88552 

0.260 

0.188 

0.85 

0.25 

0 . 67449 

0.07876 

0.68574 

0  84326 

0.246 

0.200 

0.75 

0.50 

0 

0 

0 . 76450 

0.76450 

0.222 

0.222 

0.50 

First,  enter  at  the  top  of  the  sheet  the  three  basic  quantities  N,  k,  and 
S,  S  being  defined  by  the  left-hand  side  of  equation  (47). 

Second,  enter  in  column  (1)  various  ordinates  Lp  less  than  or  equal  to 
0.5  for  which  the  corresponding  abscissas  p  are  to  be  computed. 

Third,  enter  in  column  (2)  values  KLp  of  the  normal  deviates  corre- 
sponding with  the  values  of  Lp;  these  may  be  obtained  from  Table  1.1. 

Fourth,  enter  in  column  (3)  the  products  of  S  by  the  normal  deviates 
in  column  (2). 

Fifth,  subtract  from  k  the  entries  in  column  (3)  and  enter  the  differ- 
ences in  column  (4). 

Sixth,  add  to  k  the  entries  in  column  (3)  and  enter  the  sums  in  column 
(5). 

Seventh,  enter  in  column  (6)  the  probabilities  corresponding  with  the 
normal  deviates  in  column  (4);  these  may  be  obtained  from  Table  1.5. 

Eighth,  enter  in  column  (7)  the  probabilities  corresponding  with  the 
normal  deviates  in  column  (5);  these  may  be  obtained  from  Table  1.5. 

*  The  value  15.04  percent  stated  in  Sec.  4.6  was  obtained  by  an  exact  method, 
described  in  Sec.  11.5,  which  does  not  involve  the  mathematical  approximations 
involved  in  formulas  (42)  and  (43). 
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Ninth,  subtract  from  1  the  values  of  Lp  shown  in  the  first  column  and 
enter  the  differences  in  column  (8). 

Tenth,  plot  one-half  the  OC  curve,  using  as  abscissas  the  values  of  p 
shown  in  column  (6)  and  as  ordinates  the  corresponding  values  of  Lp 
shown  in  column  (1). 

Eleventh,  plot  the  other  half  of  the  OC  curve,  using  as  abscissas  the 
values  of  pf  shown  in  column  (7)  and  as  ordinates  the  corresponding 
values  of  Lv>  shown  in  column  (8) . 

The  OC  curve  corresponding  with  Table  1.6  is  shoAvn  in  Fig.  1.4. 
The  three  dots  represent  the  central  three  points  of  Sec.  6.2,  formulas  (38) 
to  (40). 

7.  TECHNIQUES  OF  APPLYING  THE  PLAN 

7.1.  Introduction 

Without  calculating  either  the  mean  x  or  the  standard  deviation  s, 
the  computing  can  be  arranged  to  show  whether  the  acceptance  criterion 

x  +  ks  <  U  (50) 

has  been  met.  This  is  done  partly  by  rearranging. formula  (50),  and  partly 
by  carrying  out  in  advance  certain  steps  in  the  computations  that  do  not 
depend  upon  the  data. 

There  is  some  question  as  to  how  much  the  computational  simplifica- 
tions add  to  the  procedure.  They  are  likely  to  be  welcomed  by  those 
who  consider  the  standard  deviation  "too  complicated  for  practical  pur- 
poses," and  who  would  therefore  reject  (50)  as  an  acceptance  criterion  if 
it  were  not  emphasized  that  the  standard  deviation  need  not  actually  be 
computed.  Those  holding  this  view,  however,  often  have  had  little  or  no 
practical  experience  in  computing  or  using  standard  deviations,  and  what 
experience  they  have  had  may  have  been  with  inefficient  computing  tech- 
niques. In  fact,  statisticians  do  not  generally  realize  how  often  standard 
deviations  are  computed  by  the  naive  use  of  formula  (5).  The  mean  is 
calculated,  then  the  N  differences  between  it  and  the  observations  are 
found  and  recorded;  these  are  then  squared  and  added,  the  sum  divided 
by  N  —  1,  and  the  square  root  taken.  Not  only  is  this  process  tedious 
and  liable  to  error,  but  there  is  an  inherent  loss  of  accuracy  from  round- 
ing the  differences.  The  standard  deviation  should,  of  course,  be  com- 
puted from  formula  (6).  With  modern  computing  machines  it  can  be 
calculated  simultaneously  with  the  calculation  of  the  mean  and  with  very 
little  extra  trouble.  With  the  Monroe  Model  AA-1,  for  example,  the  first 
observation  is  set  in  the  keyboard  and  the  X  and  =  keys  pressed  in  rapid 
succession ;  then  the  next  observation  is  set  on  the  keyboard  and  the  same 
two  keys  pressed  again ;  and  so  on.  When  this  has  been  completed  for  all 
N  observations,  which  takes  little  more  time  and  trouble  than  adding  the 
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data,  the  quantities  2x  and  2x2  needed  for  formula  (6)  are  found  in  the 
upper  and  lower  dials,  respectively.  * 

Another  reason  computational  simplifications  are  perhaps  less  impor- 
tant than  they  may  at  first  appear  is  that  the  actual  values  of  the  mean 
and  standard  deviation,  although  not  needed  for  acceptance  purposes,  are 
valuable  for  other  purposes,  as,  for  example,  process  control.  In  fact, 
the  conditions  under  which  inspection  is  based  on  variables  instead  of 
attributes  are  likely  to  be  those  where  the  cost  of  obtaining  each  obser- 
vation is  so  high,  because  of  destruction  of  test  items,  use  of  costly  test 
equipment,  and  so  forth,  that  the  cost  of  analyzing  the  data  is  almost 
negligible  in  comparison. 

7.2.  Computational  Procedure  f 

7.2.1.  Formulas. — For  any  given  plan  of  action,  determined  by  a  fixed 
N  and  k,  the  left-hand  side  of  formula  (50)  depends  only  upon  the  sum 
of  the  observations  2#  and  the  sum  of  the  squares  of  the  observations 
Sx2.     Let 

X  =  2z  (51) 

Y  =  2z2  (52) 

These  two  sums  may  be  computed  easily  and  simultaneously  on  any 
automatic  computing  machine.     If  only  an  adding  machine  is  available, 
Y  can  be  obtained  with  the  aid  of  a  table  of  squares,  for  example,  Barlow's. 
If  k  is  positive,  it  may  be  noted  first  that  if 

X  >  NU  (53) 

the  lot  may  be  rejected  without  regard  to  F.     In  that  case  x  exceeds  U; 
since  5  must  be  positive,  x  +  ks  will  still  further  exceed  U. 
Second,  it  may  be  noted  that  if 

Y  <  ^^  (54) 

the  lot  may  be  accepted  without  regard  to  X.     Equation  (72)  below  is, 
however,  more  efficient  than  (54)  in  permitting  immediate  acceptance. 

*  This  paragraph  indicates  only  the  general  nature  of  the  procedures  for  computing 
the  standard  deviation.  Further  simplifications  are  usually  possible.  Also,  the  use 
of  the  Monroe  Model  AA-1  as  an  illustration  does  not  imply  that  this  make  or  model 
is  superior  to  other  automatic  calculating  machines.  Both  the  Friden  and  Marchant 
companies  make  excellent  machines.  The  Statistical  Research  Group  attempted  to 
evaluate  various  makes  and  models  on  the  basis  of  about  660  machine-months  of 
experience  in  its  offices  from  1942  to  1945.  The  evidence  failed  to  show  any  one 
make  or  model  to  be  best,  although  there  were  interesting  differences.  The  terms 
"upper  dials"  and  " lower  dials"  used  in  this  chapter  apply  to  Marchant  and  Monroe 
calculators  but  should  be  interchanged  for  Friden  calculators. 

f  For  a  discussion  of  the  accuracy  with  which  the  measurements  should  be  made, 
see  Chap.  4. 
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Third,  if  neither  inequality  (53)  nor  (54)  is  satisfied,  the  lot  is  accepted 

if 

Y  <  aX2  -  bX  +  c  (55) 
where 

Nk2  +  N  -  1  ,_ 

«  =  ^ (56) 

b    =   ATTT2 '  (57) 


Nk 
U2(N  -  1) 


(58) 


If  inequality  (55)  is  not  fulfilled,  the  lot  is  rejected.  The  values  of 
a,  b,  and  c  given  by  (56),  (57),  and  (58)  do  not  depend  upon  the  outcome 
of  the  inspection  but  only  upon  the  sampling  plan  defined  by  N  and  k 
and  upon  the  definition  of  a  defective  item  as  given  by  U.  When  a 
sample  has  been  inspected,  the  values  of  X  and  Y  are  obtained  from  the 
observations  and  substituted  into  (55).  Formula  (78)  below  provides  an 
additional  condition,  linear  in  X,  that  may  be  inserted  between  (54)  and 
(55) ,  which  are,  respectively,  independent  of  and  quadratic  in  X ;  formula 
(78),  however,  requires  a  preliminary  estimate  of  the  region  in  which  the 
sample  mean  is  likely  to  lie. 

7.2.2.  Example. — Consider  a  sampling  plan  in  which  N  =  95  and 
k  =  0.7645.  Suppose  that  for  the  characteristic  being  inspected  values 
in  excess  of  U  =  35  are  regarded  as  defective. 

From  these  values  of  N,  k,  and  U  we  have,  from  (56),  (57),  and  (58), 

a  =  0.0283471  (59) 

b  =  118.508  (60) 

c  =  197,019  (61) 

Now  suppose  that  a  sample  shows 

X  =  Sz  =  2,872  (62) 

7  =  Xx2  =  89,175  (63) 

First,  it  is  noted  that,  since 

X  <  3,325  (64) 

inequality  (53)  is  not  fulfilled,  so  immediate  rejection  is  not  called  for. 
Second,  it  is  noted  that,  since 

Y  >  72,675  (65) 

immediate  acceptance  is  not  possible  on  the  basis  of  (54).     Applying  (55), 
it  is  found  that 

Y  <  90,482  (66) 

Therefore,  the  lot  is  acceptable. 
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In  this  example 

x  =  30.232  (67) 

s  =  5.000  (68) 

x  +  ks  =  34.05  (69) 

and,  since  this  is  less  than  35,  acceptance  is  indicated. 

7.3.  Graphical  Procedure 

7.3.1.  Original  Data. — The  computing  involved  in  substituting  X  and 
X2  into  formula  (55)  may  be  avoided  by  the  use  of  a  chart.  Values  of  X 
may  be  plotted  horizontally  and  values  of  Y  vertically,  and  a  curve 
(parabola)  drawn  showing  for  each  value  of  X  the  minimum  value  of  Y, 
YR,  requiring  rejection  as  determined  from  formula  (55)  using  the  equality 
sign.  Then  as  soon  as  X  and  Y  are  determined  from  a  sample,  a  point 
may  be  plotted  on  the  chart.  If  the  point  falls  on  or  below  the  parabola, 
the  lot  is  accepted;  if  the  point  falls  above  the  parabola,  the  lot  is  rejected. 

Figure  1.5  shows  such  a  curve  for  the  example  of  Sec.  7.2.2,  and  the 
point  corresponding  with  the  sample  of  that  example  is  , shown.  *  Also 
shown  in  the  figure  is  a  lower  boundary  for  the  acceptance  region.  This 
lower  boundary  arises  from  the  fact  that,  for  any  given  sum  of  the  obser- 
vations X,  it  is  algebraically  .impossible  for  the  sum  of  the  squares  of  the 
observations  Y  to  be  less  than  a  certain  minimum;  this  minimum  is  the 
value  Y  takes  when  all  the  observations  are  equal,  namely, 

Ye  =  J  (70) 

If  a  sample  point  should  fall  on  the  lower  line,  it  would  indicate  that  the 
standard  deviation  was  zero.  A  sample  point  cannot  fall  below  the 
lower  line  unless  there  is  an  error  in  the  calculations. 

For  practical  purposes  only  a  part  of  the  range  of  X  shown  in  Fig.  1.5 
would  be  needed,  for  example,  that  enclosed  in  the  rectangle.  Confining 
the  chart  to  this  region,  as  in  Fig.  1.6,  permits  the  use  of  a  larger  scale 
and  thus  improves  the  accuracy  of  plotting. 

7.3.2.  Adjusted  Data. — The  graphical  procedure  of  Sec.  7.3.1  is 
impractical  when  the  observations  are  large.  In  such  cases  the  range  of 
variation  in  Y  as  X  varies  is  large  relative  to  the  width  of  the  acceptance 
band  for  a  fixed  X;  when  the  Y  scale  is  contracted  sufficiently  to  go  on  a 
sheet  of  reasonable  size,  the  upper  and  lower  boundaries  of  the  acceptance 
region  are  pulled  so  close  together  that  the  chart  becomes  useless.  A 
remedy  for  this  is  to  replace  the  sum  of  the  squares  of  the  observations  Y 

*  Formula  (54)  gives  the  minimum  value  of  formula  (55).  Actually,  it  gives  a 
value  slightly  below  the  minimum,  because  the  minimum,  which  occurs  when 
X  =  UN(N  -  l)/(Nk*  +N  -  l),isX£/2(X  -  l)/(Nk2  +  N  -  1).  If  Y is  as  small 
as  (54),  it  is  impossible  for  X  to  be  too  large;  this  is  illustrated  in  Fig.  1.5. 
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by  the  sum  of  the  squares  of  adjusted  observations  Y'  where 

T  =  X(x  -  A)2  (71) 

=  Y  -  2AX  +  NA2 

and  A  is  an  adjustment  constant.  The  adjustment  constant  is,  in  effect, 
subtracted  from  each  observation  before  squaring;  actually  the  sum  of 
squares  of  the  adjusted  observations,  Y',  is  obtained  directly  from  X  and 
Y  by  formula  (71)  without  adjusting  each  observation. 

In  order  to  make  the  chart  most  useful,  the  adjustment  constant  A 
should,  if  possible,  be  chosen  near  the  mean  of  the  characteristic  being 
inspected.  Usually  there  will  be  some  evidence  as  to  the  general  neigh- 
borhood in  which  the  mean  lies.  If  there  is  no  evidence  at  all,  it  will  often 
be  suitable  to  choose  A  somewhere  in  the  interval  between  U/(k2  +  1) 
and  U. 

To  compute  F',  X  and  Y  are  obtained  simultaneously  as  described 
in  Sec.  7.1,  and  X  is  recorded.  Then  X  is  copied  to  the  keyboard  and 
multiplied  negatively  by  2A,  which  may  be  computed  in  advance.  Next 
NA2,  which  also  should  be  computed  in  advance,  is  added.  These  two 
operations  adjust  the  sum  of  squares,  and  Y'  may  be  copied  from  the 
lower  dials.  An  alternative  computing  procedure,  which  may  be  useful 
if  the  same  inspection  plan,  or  at  least  the  same  adjustment  constant,  is 
to  be  used  repeatedly,  is  to  prepare  a  table  showing  (x  —  A)2  for  the 
various  values  of  x  that  might  be  observed.  From  this  table  Y'  can  be 
found  by  simple  addition. 

The  values  of  X  and  Y'  determine  whether  the  acceptance  criterion 
has  been  met.  If  X  >  NU,  immediate  rejection  may  be  made,  accord- 
ing to  formula  (53).     Instead  of  (54)  for  immediate  acceptance,  we  have 

T <  (^-w-i)  (72) 

If  this  is  fulfilled  the  lot  may  be  accepted  immediately.  Notice  that  Yr 
may  satisfy  (72)  even  when  Y  fails  to  satisfy  (54).  Otherwise,  a  chart 
similar  to  Fig.  1.6  may  be  drawn  in  which  the  upper  boundary  of  the 
acceptance  region  is  given  by 

Y'R  =  aX2  -  (6  +  2A)X  +  (c  +  NA2)  (73) 

where  a,  b,  and  c  are  defined  by  (56),  (57),  and  (58),  and  the  lower  bound- 
ary is  given  by 

Y'E  =  j^-2AX  +  NA2  (74) 

Figure  1.7  shows  such  a  chart  for  the  example  of  Sec.  7.2.2,  the. 
adjustment  constant  being  taken  as  A  =  30.     Thus,  the  equations  of  the 
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Fig.  1.5. — Graphical  procedure  for  determining  from  2a;  and  Sx2  whether  x  +  ks  <  U; 
N  =  95,  k  =  0.7645,  U  =  35. 


Sec.  7.3.2 


VARIABLES 


39 


Y=Exz 


REJ 

ECT 

4 

v/ 

f/7 

Xs 

A     2 

ample  point 
,872;  89,175 

yr      AC 

:ept    / 

ERF 

*0R 

2,600  2,700  2,800  2,900  3,000  3,100  3,200  3,300  3,400 

X  =  Ix 
Note:  See  note  to  Fig.  1.5. 
Fig.  1.6.— Enlarged  segment  of  computational  chart  in  terms  of  original  data  (enlargement 

of  area  indicated  in  Fig  1.5). 


40 


TECHNIQUES  OF  STATISTICAL  ANALYSIS 


Chap.  1 


Y'».I(x-30) 


9,000 

8,000 

7,000 
6,000 
5,000 
4,000 

\  n 

\% 

REJ 

ECT 

\  <* 

o 

3,000 

ACC 

EPT 

2,000 
1,000 

Sample  p 
2,872;   2, 

jint — 

355          Ti 

0 

^ 

£-60** 

8^2^ 

ERROR 

i 

2,600  2,700  2,800  2,900  3,000  3,100  3,200  3,300  3,400 

X  =  Ix 
Note:  See  note  to  Fig.  1.5. 
Fig.  1.7. — Graphical  procedure  for  determining  from  2x  and  2(x  —  A)2  whether 
x  +  ks  <  U;  N  =  95,  k  =  0.7645,  U  =  35,  A  =  30. 


Sec.  8.1  VARIABLES  41 

upper  and  lower  boundaries  of  the  acceptance  region  are 

Y^  =  0.0283471X2  -  178.508X  +  282.519  (75) 

Y'E  =  0.0105263X2  -  60X  +  85.500  (76) 

Since  2 A  =  60  and  NA2  =  85,500,  the  value  of  Y'  in  the  sample  is,  by 
formula  (71),  2,355.  This  does  not  satisfy  inequality  (72),  which  would 
require  that  Y'  be  1,483  or  less;  but  when  the  point  X  =  2,872,  F  =  2,355 
is  plotted  in  Fig.  1.7,  it  is  seen  to  be  within  the  acceptance  region. 

It  is  stated  in  Sec.  7.2.1  that  (72)  is  more  effective  than  (54)  in 
permitting  immediate  acceptance.  In  the  present  example,  (72)  requires 
that  Y'  <  1,483,  which  (since  X  =  2,872)  is  equivalent  to  requiring  that 
Y  <  88,303,  whereas  (54)  requires  that  Y  <  72,675.  Thus  values  of  Y 
between  72,675  and  88,303  will  lead  to  immediate  acceptance  by  (72) 
but  not  by  (54).  From  the  point  of  view  of  permitting  acceptance  on 
the  basis  of  Y',  rather  than  the  point  of  view  of  making  the  chart  usable, 
very  nearly  the  most  efficient  form  of  (72)  is  obtained  by  setting 

A  =  *  ±  *>'  -  V  (77) 

where  /z'  is  the  best  estimate  of  x  available  before  the  sample  is  inspected. 
In  the  present  example,  if  n'  is  taken  as  30,  formula  (77)  shows  A  about 
21.5.  Taking  A  =  21,  formula  (72)  becomes  Y'  <  11,628,  and  since 
the  sample  value  of  Y'  is  10,446,  immediate  acceptance  is  possible.  The 
value  of  A  given  by  (77)  may  be  made  the  basis  of  a  condition  for  accept- 
ance in  terms  of  Y;  intermediate  between  (54)  and  (55),  which  are, 
respectively,  constant  and  quadratic  in  X,  the  linear  condition  for  accept- 
ance may  be  inserted 

Y  <c'  -  b'X  (78) 

where 

V  =  b  -  2aNp'  (79) 

c'  =  c-  aNV2  (80) 

and  a,  b,  and  c  are  denned  by  (56),  (57),  and  (58).  Taking  \i'  =  30  in 
the  illustrative  example,  (78)  becomes  Y  <  90,468,  which  leads  to  accept- 
ance. If  the  value  of  \i!  happens  to  coincide  with  x,  (78)  becomes 
equivalent  to  (55). 

8.  REDUCED  AND  TIGHTENED  INSPECTION 

8.1.  Purpose  of  Reducing  or  Tightening  Inspection 

It  is  emphasized  in  Sees.  3.4  and  6.1  that  the  OC  curve  shows  the 
conditional  probability  that  if  a  submitted  lot  has  a  given  quality  p 
it  will  be  accepted.  It  does  not  show  the  probability  that  a  lot  of 
quality  p  will  be  accepted.     This  latter  probability  cannot  be  controlled 
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by  the  sampling  plan  alone,  for  it  depends  upon  the  probability  that  a  lot 
of  quality  p  will  be  submitted. 

In  judging  the  value  of  the  protection  afforded  by  a  sampling  plan, 
information  about  the  quality  of  submitted  material  is  important.  The 
fact  that  a  plan  rejects  90  percent  of  submitted  lots  having  a  certain 
unacceptable  quality  may  be  of  little  importance  if  a  great  many  of  the 
submitted  lots  are  of  that  quality;  in  that  case  many  of  the  lots  passed 
by  the  plan  will  be  unacceptable,  even  though  most  of  the  submitted  lots 
are  rejected.  The  fact  that  the  plan  rejects  90  percent  of  submitted 
lots  of  an  unacceptable  quality  p  is  also  unimportant  if  submitted  lots 
rarely  have  that  quality,  for  then  very  little  such  material  will  be  accepted, 
even  though  most  submitted  lots  are  passed.  In  the  first  case,  when  bad 
material  is  submitted  very  often,  the  protection  given  by  the  sampling 
plan  may  be  inadequate  and  require  tightening;  but  in  the  second  case, 
when  bad  material  is  rarely  submitted,  the  protection  may  be  superfluous 
and  reduction  may  be  permissible.  * 

8.2.  Criteria  for  Reducing  or  Tightening  Inspection 

8.2.1.  Process  Average  Percent  Defective. — The  principal  criterion 
for  determining  whether  reduced  inspection  is  permissible  or  tightened 
inspection  necessary  is  the  process  average  percent  defective,  usually 
called  simply  the  process  average  or  PA,  which  is  an  estimate  of  the 
percent  defective  computed  from  a  number  of  lots. 

This  estimate  is  best  stated  as  a  confidence  interval,  that  is,  an  interval 
within  which  it  may  be  asserted  with,  say,  95  percent  confidence  that  the 
true  process  average  lies.  If  the  estimate  were  stated  as  a  single  figure,  it 
would  invariably  differ  from  the  true  process  average  because  of  sampling 
fluctuations;  and  the  question  would  therefore  arise  as  to  whether  the 
difference  between  the  estimated  process  average  and  the  acceptable 
quality  level  could  reasonably  be  ascribed  to  sampling  fluctuations,  or 
whether  on  the  other  hand  it  indicated  a  real  discrepancy  between  the  true 
process  average  and  the  acceptable  quality  level.  A  confidence  interval 
answers  such  questions.  It  is  constructed  from  the  sample  data  in  such  a 
way  that  95  percent,  or  any  other  desired  proportion  y,  of  the  samples  will 
lead  to  confidence  intervals  that  overlap  the  true  process  average. 

For  the  purpose  of  computing  the  process  average,  all  data  in  the 
samples  from  a  number  h  of  consecutive  lots  are  thrown  together  into  one 
large  sample.  The  mean  x  and  the  standard  deviation  s  of  this  combined 
sample  are  computed  from  formulas  (4)  and  (6).     Alternatively,  they 

*  For  a  general  discussion  of  the  principles  and  methods  of  tightening  and  reducing 
inspection,  see  Statistical  Research  Group,  Sampling  Inspection.  The  discussion 
there  relates  to  inspection  by  attributes,  but,  insofar  as  it  runs  in  terms  of  the  OC 
curve  and  the  amount  of  inspection,  it  is  also  applicable  to  variables. 
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may  be  computed  from  the  means  Xi,  the  standard  deviations  Si,  and  the 
sample  sizes  Ni  of  the  individual  samples  by  the  following  formulas: 

_       Nixi  +  N2x2  +   •  •  •   +  Nhxh       ZNjXj 

x~        N!  +  N2  +  •  •  •  +  Nh  N  {*L) 

(Ni  -  l)sf  +  (N2  -  1)4  +  •  -  •  +  (Nh  -  1)4 

+  ^!(xi  -  *)2  +  7V2(x2  -  x)2  +   •  •  •   +#*(:**  -  x)2 


2(iVi  -  l)sf  +  ZNiXJ 


Ni  +  iV2  +   •  •  •   +  N, 
(ZNiXi)2 


(82) 


A" 


N  -  1 
where 

N  =  2Ni  (83) 

From  x,  s,  and  U  we  compute  A;0,  the  observed  difference  between  U 
and  the  sample  mean  in  units  of  the  sample  standard  deviation 

K  =  IL-l  (84) 

Then,  given  a  confidence  coefficient  y,  a  confidence  interval  which  in  a 
fraction  7  of  samples  will  overlap  the  true  fraction  defective  may  be 
obtained  from  formulas  (42)  and  (43)  by  substituting  k0  for  k  and  (1  —  y)/2 
for  Lp.  The  p  and  p'  given  by  (42)  and  (43)  are  the  upper  and  lower  con- 
fidence limits  py  and  py,  respectively,  for  the  fraction  defective  produced 
by  the  process.  The  larger  the  confidence  coefficient,  the  broader  the 
interval;  and  the  smaller  the  confidence  coefficient,  the  narrower  the 
interval.  In  fact,  if  7  is  1,  the  interval  runs  from  0  to  100  percent; 
and  if  7  is  0,  the  interval  includes  only  the  point  p  =  p,  where  p  is  the 
probability  that  a  normal  deviate  will  exceed  k0. 

If  the  acceptable  quality  level  lies  above  the  confidence  interval, 
reduced  inspection  is  permissible.  *  If  it  lies  below  the  confidence  interval, 
tightened  inspection  is  necessary.  If  it  lies  within  the  confidence 
interval,  normal  inspection  may  be  continued. 

In  order  to  avoid  too  frequent  shifts  in  the  degree  of  inspection,  it  is 
advisable  that  the  criteria  for  resuming  normal  inspection  when  tightened 
or  reduced  inspection  is  in  effect  should  differ  from  the  criteria  for  remain- 
ing on  normal  inspection  when  it  is  already  in  effect.  One  way  to  achieve 
this  is  to  use  a  larger  confidence  coefficient  when  deciding  whether  to 
continue  normal  inspection  than  when  deciding  whether  to  resume  nor- 

*  For  reduced  inspection,  it  is  customary  to  impose  a  further  requirement  to 
restrict  the  variability  of  the  process  average  from  lot  to  lot.  This  usually  takes  the 
form  of  requiring  that  each  of  the  last  twenty  (or  some  other  number)  lots  have  been 
acceptable. 
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mal  inspection.  Reduced  inspection,  for  example,  will  be  installed  as 
soon  as  the  process  average  falls  to  a  certain  level,  but  once  it  has  been 
installed  normal  inspection  will  not  be  resumed  unless  this  level  is  exceeded 
by  a  certain  margin.  It  is  therefore  possible,  once  either  reduced  or 
tightened  inspection  has  been  installed,  to  continue  it  under  conditions 
that  would  not  justify  its  introduction  were  normal  inspection  in  effect. 

8.2.2.  Illustrative  Computation  of  Process  Average. — Suppose  that 
the  combined  sample  from  several  inspection  lots  contains  95  observa- 
tions, that  their  sum  is  2,872,  that  the  sum  of  their  squares  is  89,175, 
and  that  values  above  35  represent  defects.  Suppose  a  confidence  coeffi- 
cient yc  of  0.99  is  used  for  judging  whether  to  continue  normal  inspection 
and  a  confidence  coefficient  yr  of  0.95  for  judging  whether  to  resume  nor- 
mal inspection.     Thus 


N  =  95         Xx  =  2,872 
U  =  35          yc  =  0.99 

Xx2  =  89,175 
yr  =  0.95 

(85) 
(86) 

mulas  (4)  and  (6) 

x  =  30.2316         s 

=  5.000 

(87) 

7         35  -  30.2316 

k0  — 

=  0.9537 

(88) 

so 


Entering  Table  1.5  with  K€  =  0.9537,  we  find  (by  linear  interpolation) 
17.01  percent  as  the  point  estimate  p,  corresponding  with  a  confidence 
coefficient  of  zero,  of  the  process  average  percent  defective. 

To  find  the  99  percent  confidence  limits,  we  substitute  k  =  0.9537, 
Lp  =  (I  -  0.99)/2  =  0.005  into  (42)  and  (43).  The  details  of  a  similar 
computation  are  shown  in  Sec.  6.3.1.  The  99  percent  confidence  limits 
are 

p0  99  =  10.15  percent         and         p0.99  =  26.29  percent         (89) 
Similarly,  taking  Lp=  0.025,  the  95  percent  confidence  limits  are 

£0  95  =  11.57  percent         and         p0.95  =  23.86  percent  (90) 

If  normal  inspection  is  in  effect,  it  will  be  continued  if  the  AQL  for  the 
product  is  between  10.15  and  26.29  percent  defective;  reduced  inspection 
will  be  installed  if  the  AQL  exceeds  26.29  percent  and  tightened  inspection 
will  be  installed  if  the  AQL  is  below  10.15  percent.  If  tightened  or 
reduced  inspection  is  in  effect,  normal  inspection  will  be  resumed  if 
the  AQL  is  between  11.57  and  23.86  percent.  If  the  difference  between 
the  intervals  for  continuation  and  for  resumption  is  too  small,  that  is,  if 
too  many  shifts  occur,  it  can  be  widened  by  increasing  the  difference 
between  the  two  confidence  coefficients  used. 
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For  routine  operations  in  which  the  same  AQL  and  the  same  value  of 
U  are  to  be  used  repeatedly,  it  is  desirable  to  prepare  a  table  showing  for 
various  sample  sizes  the  values  of  k0  that  result  in  continuing  normal 
inspection  if  it  is  in  use  and  those  that  result  in  resuming  normal  inspec- 
tion if  tightened  or  reduced  inspection  is  in  use.  In  the  present  illustra- 
tion, for  example,  if  the  AQL  is  15  percent,  a  value  of  k0  computed  from 
95  observations  leads  under  normal  inspection  to  continuing  normal 
inspection  if  it  lies  between  &0.99  =  0.7380  and  /c0.99  ==  1.4107,  or  leads 
under  reduced  or  tightened  inspection  to  resuming  normal  inspection  if  it 
lies  between  fto.95  =  0.8048  and  fto.95  =  1.3113.  These  values  are  deter- 
mined from  formula  (21),  using  both  the  plus  and  the  minus  signs  to  obtain 
upper  and  lower  limits  and  taking  pi  =  AQL,  a  =  (1  —  y)/2.  An  exam- 
ple of  the  computation  of  (21)  is  given  in  Sec.  4.5.2. 

8.3.  Methods  of  Reducing  or  Tightening  Inspection 

Reduced  inspection  may  be  achieved  by  reducing  N,  perhaps  as 
much  as  40  or  50  percent,  and  adjusting  k  so  that  the  producer's  risk 
will  not  be  increased.  This  leaves  the  OC  curve  approximately  unchanged 
for  qualities  equal  to  or  better  than  the  AQL  but  increases  the  risk  that 
lots  of  worse  quality  will,  if  submitted,  be  accepted. 

Tightened  inspection  may  be  achieved  by  increasing  k  but  leaving  N 
fixed.  This  moves  the  OC  curve  to  the  left,  thus  reducing  the  consumer's 
risk  but  increasing  the  producer's  risk. 

9.   CHOICE   OF  QUALITY  STANDARDS  AND  EVALUATION   OF  PROPOSED 

SAMPLING  PLANS 

9.1.  General  Discussion 

The  definition  of  a  defective  item,  the  choice  of  an  AQL  and  an 
LTPD,*  and  the  selection  of  producer's  and  consumer's  risks  usually 
require  a  compromise  between  what  is  technically  or  economically  desir- 
able in  the  use  for  which  the  product  is  intended  and  what  is  technically  or 
economically  feasible  in  its  production.  Although  no  fixed  rules  can  be 
established  for  selecting  U,  ph  p2,  a,  and  ft,  a  few  guides  are  worth  noting. 

*  LTPD,  or  lot  tolerance  percent  defective,  is  defined  sometimes  in  terms  of  the 
use  for  which  the  product  is  intended,  as  the  worst  quality  for  which  there  should  be 
any  appreciable  probability  of  a  lot's  being  accepted,  and  sometimes,  in  terms  of  the 
sampling  plan,  as  that  quality  for  which  the  probability  of  acceptance  has  a  specified 
value  (customarily  10  percent).  Similarly,  the  AQL,  or  acceptable  quality  level,  is 
defined  sometimes  as  the  worst  quality  that  can  be  consistently  accepted  or  as  the 
best  quality  for  which  there  should  be  any  appreciable  probability  of  a  lot's  being 
rejected,  and  sometimes,  as  that  quality  for  which  a  certain  sampling  plan  gives  a 
specified  probability  of  rejection  (customarily  5  percent).  We  follow  the  definitions 
in  terms  of  sampling  plans,  using  the  customary  risks  of  5  and  10  percent  for  the 
AQL  and  LTPD,  respectively. 
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Some  of  these  five  quantities  can  be  selected  arbitrarily  or  by  conven- 
tion without  restricting  the  range  of  available  plans.  For  example, 
it  is  more  or  less  conventional  to  take  the  producer's  risk  a  as  5  percent 
and  the  consumer's  risk  /3  as  10  percent.  Then  the  AQL,  or  ph  may  be 
defined  as  that  quality  for  which  the  probability  of  acceptance  should  be 
95  percent  and  the  LTPD,  or  p2,  as  that  quality  for  which  the  probability 
of  acceptance  should  be  10  percent.  Sometimes  a  and  (3  are  set  tenta- 
tively and  values  of  U,  ph  and  p2  selected;  then  the  values  of  a  and  /3  may 
be  reexamined  in  the  light  of  the  number  of  lots  of  qualities  pi  and  p2 
likely  to  be  submitted  and  the  costs  of  rejecting  those  of  quality  p\  or 
accepting  those  of  quality  p2. 

To  select  a  sampling  plan,  two  independent  analyses  should  be  made 
and  their  results  merged.  One  study  should  consider  the  uses  to  which 
the  product  is  to  be  put.  What  is  required  of  each  item,  how  rigid  are 
these  requirements,  and  what  is  the  effect  of  a  certain  percentage  of  the 
items  failing  to  meet  the  requirements  ?  The  other  study  should  consider 
the  production  process.  Within  what  limits  do  most  of  the  items  fall, 
how  stable  are  these  limits,  what  would  be  the  costs  of  improving  them, 
and  what  is  the  cost  of  having  a  lot  rejected?  In  designing  a  sampling 
plan  on  the  basis  of  these  two  studies,  it  is  necessary  to  take  into  account 
the  cost  of  inspection.  What  is  the  cost  of  drawing  a  sample,  what  is 
the  cost  of  inspecting  the  items,  what  is  the  cost  of  analyzing  the  data; 
and  how  do  all  these  vary  with  the  size  of  the  sample? 

9.2.  Analysis  of  Production  Possibilities 

It  should  be  assumed  that  the  producer  will  consider  the  OC  curve  as 
in  effect  the  specification  of  what  will  be  accepted,  and  that  he  will  adjust 
his  operations  to  an  OC  curve  as  best  suits  his  own  circumstances.  For 
example,  if  the  producer  finds  that  a  rejected  lot  entails  considerable 
costs,  he  will  consider  that  the  given  OC  curve  actually  forces  him  to 
produce  at  a  quality  much  better  than  the  AQL,  how  much  better  depend- 
ing upon  how  sharply  the  cost  of  production  rises  and  the  probability  of  a 
rejection  declines  as  quality  improves.  If  improvements  in  quality  are 
also  costly,  a  given  OC  curve  may  mean  that  the  consumer  will  receive — 
and,  of  course,  ultimately  pay  for — better  quality  than  he  needs.  Under 
such  circumstances  the  consumer  will  find  it  advantageous  to  increase  the 
amount  of  inspection,  for  which  also  he  ultimately  pays;  this  will  steepen 
the  OC  curve,  that  is,  increase  the  rate  at  which  the  probability  of  a 
rejection  declines  as  quality  improves,  and  therefore  reduce  the  degree  to 
which  the  producer  is  forced  to  improve  the  quality  and  raise  the  cost.  * 

There  is  obviously  a  relation  between   U  on  the  one  hand  and  pi 

*  Some  discussion  of  this  point  is  given  in  J.  Przyborowski  and  H.  Wilenski, 
"Testing  Clover  Seed,"  especially  Sec.  111(6),  pp.  281-285. 
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and  p2  on  the  other.  If  U  is  set  very  high,  only  very  bad  items  are 
considered  defective;  only  a  small  percentage  of  such  items  can  be 
accepted,  so  pi  should  be  small.  If  U  is  set  very  low,  items  are  considered 
defective  when  they  deviate  very  little  from  normal;  the  effect  of  such 
items  is  less  serious,  so  a  larger  percentage  of  them  can  be  accepted. 

If  we  regard  U  as  fixed,  we  may,  by  analyzing  data  on  the  quality 
characteristic  that  is  to  be  inspected,  estimate  what  proportion  defective 
is  being  produced.  Section  8.2  gives  a  method  for  finding  confidence 
intervals  py  to  py  which,  in  a  proportion  y  of  the  samples  from  which 
such  confidence  intervals  are  computed,  overlap  the  true  fraction  defec- 
tive; or,  if  the  confidence  coefficient  is  reduced  to  zero,  we  may  obtain  a 
single-valued  estimate  p.  The  height  of  the  OC  curve  over  the  interval 
py  to  py  then  indicates  approximately  how  much  of  the  current  production 
would  be  passed  by  a  proposed  sampling  plan. 

If,  on  the  other  hand,  we  regard  p\  as  fixed,  we  may  estimate  the 
value  of  U  necessary  to  give  a  confidence  y  that  the  fraction  defective  will 
not  exceed  pi.     This  value  is 

Uy  =  x  +  kyS  (91) 

where  ky  is  given  by  formula  (21),  using  the  plus  sign  and  taking  px  as 
the  fixed  fraction  defective  and  a  as  1  —  7.  Thus,  in  the  example  of 
Sec.  8.2.2,  where  px  =  0.15,  x  =  30.232,  and  s  =  5.000,  L.95  =  1.2632 
and  A;0.99  =  1.3695;  so  we  may  be  95  percent  confident  that  under  the 
conditions  prevailing  when  the  sample  was  collected  at  least  85  percent 
of  the  items  lie  below  Uo.w  =  36.54,  or  99  percent  confident  that  at  least 
85  percent  lie  below  U0.99  =  37.08.      U  is  called  a  "tolerance  limit."* 

9.3.  Analysis  of  Consumption  Requirements 

In  analyzing  the  consumption,  or  design,  requirements  of  a  product, 
account  should  be  taken  of  the  variability  of  other  products  or  methods 
with  which  it  will  be  associated  in  use.  A  discussion  of  this  is  beyond  the 
scope  of  this  chapter,  f  but  one  especially  simple  example  may  be  men- 
tioned. 

*  See  Chap.  2  for  a  discussion  of  tolerance  limits.  Chapter  2,  however,  is  con- 
cerned with  two-sided  tolerance  limits,  that  is,  values  of  k  such  that  at  least  a  given 
proportion  P  of  the  population  is  included  between  the  limits  x  —  ks  and  x  +  ks. 

An  excellent  discussion  of  tolerance  limits,  which  in  fact  is  the  original  source 
from  which  subsequent  theoretical  work  has  developed,  is  given  in  Shewhart,  Statistical 
Method,  Chap.  II,  pp.  50-79,  How  Establish  Limits  of  Variability? 

|  See  Shewhart,  Economic  Control  of  Quality,  Part  V,  pp.  247-272,  Statistical 
Basis  for  Specification  for  Standard  Quality,  especially  Chap.  XVII,  pp.  249-261, 
Design  Limits  on  Variability,  in  which  Sec.  5,  pp.  252-256,  considers  Tolerances  for 
Quality  of  Finished  Product  in  Terms  of  Tolerances  of  Piece-parts,  and  Sec.  6,  pp. 
256-259,  discusses  The  General  Problem  of  Setting  Tolerances  on  Controlled  Product. 
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Suppose  the  items  from  a  lot  are  to  be  assembled  at  random  in  sets  of 
n,  and  a  set  of  n  will  be  defective  if  the  sum  of  the  lengths  of  the  n  pieces 
exceeds  nil' ',  that  is,  if  the  mean  length  in  the  set  exceeds  U'.  If  we 
consider  lot  quality  measured  by  the  percent  of  means  of  n  that  exceed  U' 
and  select  two  points  on  the  desired  OC  curve,  values  of  N  and  k,  say  N' 
and  k' ',  can  be  obtained  from  (10)  and  (11)  or  (12).  If  now  inspection  is 
carried  out  on  the  basis  of  individual  items,  rather  than  sets  of  n,  the 
lot  can  be  accepted  if  in  a  sample  of  nNr  items 

x  +  -^  s  <  U'  (92) 

n 


where  x  and  s  are  the  mean  and  standard  deviation  computed  from  the  nN' 
items. 

If  in  this  example  a  piece  were  considered  defective  if  it  exceeded  one- 
nth  the  allowable  sum  for  n  pieces,  that  is,  if  it  exceeded  U',  the  sampling 
plan  would  require  a  sample  of  N'  items  and  the  acceptance  requirement 
would  be 

x  +  k's  <  V  (93) 

This  would  necessitate  production  at  a  higher  quality  than  would  (92). 
For  example,  if  n  =  4  and  k  =  0.7645,  the  indifference  quality,  that  is, 
the  quality  having  an  equal  chance  of  acceptance  or  rejection,  is  35.12 
percent  defective  for  (92)  and  22.23  percent  for  (93).  If  n  —  9  and 
k  =  2.0077  (the  k  given  in  Table  1.2  for  pi  =  0.015,  p2  =  0.03),  the  indif- 
ference quality  is  25.17  percent  under  (92)  and  2.23  percent  under  (93). 

As  a  digression,  it  may  be  observed  that  averages  in  sets  of  n  may  be 
appropriate  measures  of  quality  when  inspection  is  undertaken  to  deter- 
mine Avhether  full  measure  is  delivered.  Suppose  tomato  juice  is  packed 
in  one-gallon  cans*  that  the  final  purchasers  buy  one  at  a  time.  Then 
any  purchaser  whose  can  contains  less  than  one  gallon  is  cheated,  whether 
or  not  some  other  purchaser  receives  an  equivalent  or  even  a  greater 
excess.  Even  though  the  purchaser  would  not  be  cheated  in  the  long  run 
if  his  purchase  were  repeated  frequently,  an  appreciable  shortage  in  any 
one  can  may  inconvenience  or  annoy  him — in  fact,  may  prevent  his 
repeating  the  purchase.  In  such  circumstances,  the  appropriate  meas- 
ure of  lot  quality  is  the  percent  of  cans  containing  less  than  a  stated 
amount,  say  one  gallon,  or  perhaps  0.99  gallon  if  a  1  percent  deviation  is 
permissible.  If,  however,  the  tomato  juice  is  to  be  sold  to  restaurants  in 
cases  of  12  cans,  the  appropriate  measure  of  lot  quality  is  the  percent  of 
groups  of  12  that  average  less  than  one  gallon  per  can.  If  the  tomato 
juice  is  to  be  sold  to  Army  camps  in  carloads,  the  percent  of  carloads 

*  The  example  is  taken  from  Peach,  Industrial  Statistics,  p.  86.  Peach,  however, 
treats  the  example  differently. 
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averaging  less  than  one  gallon  per  can  is  the  appropriate  measure  of  lot 
quality;  in  such  a  case,  however,  one  carload  might  well  be  an  inspection 
lot,  so  that  the  lot  mean  would  become  the  appropriate  quality  measure. 

9.4.  Normality  Assumption 

If  within  lots  the  distribution  of  items  is  not  approximately  normal,  the 
sampling  plan  usually  controls  the  percent  defective  as  consistently  as  if 
the  distribution  were  normal,  but  not  at  the  level  intended. 

What  the  plan  actually  controls  is  the  quantity  defined  by  equation 
(1),  namely, 

K  =  V^  (94) 

(T 

Thus  a  lot  is  rejected  if  the  sample  value  of  K,  say  k0,  is  too  small  and 
accepted  if  it  is  sufficiently  large.  The  OC  curve  might  be  plotted  with 
the  horizontal  axis  representing  the  true  value  of  K  in  submitted  lots; 
such  an  OC  curve  would  be  essentially  correct  regardless  of  the  normality 
of  the  distribution  within  lots.  The  assumption  of  normality  becomes 
important  when  translating  values  of  K  into  values  of  p,  the  fraction 
defective. 

If  the  distribution  is  not  normal  but  its  form  is  stable  from  lot  to  lot 
and  is  such  that  the  percent  defective  is  determined  by  K,  then  it  is  possi- 
ble to  substitute  for  Table  1.5  another  table  more  appropriate  for  translat- 
ing K  into  percent  defective.  In  any  event,  it  may  be  assumed  that  the 
fraction  defective  corresponding  with  any  K  does  not  exceed  (l/K)2;  and 
if  the  distribution  is  unimodal  and  has  its  mode  near  its  mean  it  may  be 
assumed  that  for  any  K  the  fraction  defective  does  not  exceed  (2/3K)2.* 

The  effects  of  non-normality  are  excellently  stated  by  Jennett  and 
Welch  :f 

The  above  theory  is  developed  on  the  assumption  that  the  parent  population 
sampled  is  Gaussian.  Even  if  it  is  not,  the  reasoning  relating  the  distribution  of 
k0  with  K  will  not  be  invalidated  unless  the  samples  are  very  small,  because  non- 
normality  will  not  have  much  influence  on  x  and  s.  The  chief  error  lies  in  con- 
necting K  with  p  by  using  the  normal  tables.  This  can  be  obviated  if  some  other 
constant  distributional  form  can  be  assumed  for  which  the  relation  between  the 
standardized  deviate  K  and  p  is  known.  If  we  could  say  that  the  distribution 
would  maintain  the  same  shape  .  .  .  /although  with  possible  changes  in  the 
mean  and  standard  deviation,  the  above  theory  would  need  little  modification. 

*  These  quantities  are  based  on  the  Tchebycheff  and  Camp-Meidell  inequalities, 
respectively  (see  Shewhart,  Economic  Control  of  Quality,  pp.  176-177) ;  in  the  present 
context  they  can  be  halved  if  symmetry  can  be  assumed,  and  reduced  further  if 
positive  skewness  is  assumed. 

f  "The  Control  of  Proportion  Defective."  The  passage  quoted  is  from  Sec.  3, 
p.  88,  Effect  of  Non-normality.  The  notation  in  the  passage  as  quoted  has  been 
altered  to  agree  with  that  used  in  this  chapter. 
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It  would  be  misleading,  however,  to  lay  too  much  emphasis  on  the  necessity 
for  making  the  connection  between  K  and  p  correctly.  Usually  it  is  not  possible 
to  say  with  absolute  certainty  that  a  particular  value  of  p  is  satisfactory  and 
another,  differing  little  from  it,  is  unsatisfactory.  Indeed,  often  it  may  not  be 
clear  that  p  has  any  intrinsic  importance  at  all,  but  it  may  still  be  useful  for 
purposes  of  control.  For  the  same  reason  K  may  be  regarded  as  a  useful  feature 
of  the  distribution  to  aim  at  controlling,  without  inquiring  too  closely  into  the 
exact  meaning  of  K.  Non-normality  is  therefore  not  such  a  serious  matter  as  it 
may  appear.  However,  no  final  word  can  be  said  about  this  without  reference 
to  the  context  in  which  the  method  is  to  be  used. 

10.  SIMULTANEOUS  USE  OF  SEVERAL  SAMPLING  PLANS 
10.1.  Inspection  of  Several  Quality  Characteristics 

Often  the  quality  of  an  item  depends  upon  several  variables.  There 
may  be  measurements  of  physical  dimensions,  for  example,  those  which 
determine  whether  the  item  will  fit  properly  in  a  final  assembly,  or 
whether  full  measure  has  been  delivered;  measurements  of  the  properties 
of  the  item,  for  example,  hardness,  chemical  composition,  tensile  strength, 
or  modulus  of  elasticity;  measurements  of  performance,  for  example, 
duration  of  useful  life  or  rate  of  output;  and  so  forth.  The  lot  quality 
then  is  measured  by  n  fractions  defective,  say  plf  p2,  .  .  .  ,  pn. 

Suppose  that  n  criteria  of  the  form 

x  +  ks  <  U  (95) 

are  applied  to  a  single  sample  of  N.  The  probability  that  the  lot  will 
pass  all  n  tests  cannot  be  read  from  the  OC  curve  for  any  one  test.  For 
a  given  set  of  values  pi,  p2,  .  .  .  ,  pn,  the  probability  that  a  lot  will  be 
accepted  cannot  be  more  than  the  smallest  among  the  n  values  of  Lp; 
it  will  be  this  large  if,  and  only  if,  the  n  tests  are  perfectly  correlated,  so 
that  a  lot  passing  any  one  test  is  certain  to  pass  all.  At  the  other  extreme, 
the  probability  of  acceptance  may  be  zero;  it  will  be  this  small  if  a  lot  that 
passes  one  test  is  certain  to  fail  some  other,  or  if  Lp  is  zero  for  any  of  the  n 
values  of  p.  * 

*  It  might  be  thought  that  these  two  extreme  cases  would  never  be  encountered  in 
practice.  Specification  writing  and  acceptance  test  designing  being  what  they  are, 
this  is  not  the  case.  The  author  cherishes  the  memory  of  one  particular  specification 
for  a  cylindrical  tube.  It  specified  length,  inside  diameter,  outside  diameter,  con- 
centricity, weight,  density,  absence  of  voids,  and  all  the  details  of  chemical  composi- 
tion, physical  properties,  and  manufacturing  processes  that  determine  density.  Such 
a  specification  must  be,  at  least  in  some  degree,  either  redundant  or  inconsistent;  this 
particular  one  perhaps  had  been  redundant  originally,  but  through  an  encrustation  of 
changes  made  in  one  detail  at  a  time  it  had  become  inconsistent.  It  would  be  naive 
to  assume,  however,  that  because  no  material  could  meet  the  acceptance  criteria  none 
was  accepted.  On  the  contrary,  all  of  it  was  accepted.  This  was  achieved  partly  by 
the  buyer's  reasoning  that  the  product  was  needed,  that  the  acceptance  tests  failed 
to  pass  usable  material,  and  that  therefore  material  failing  to  pass  the  tests  was  usable; 
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Between  these  two  extremes  of  perfect  positive  correlation  and 
perfect  negative  correlation  among  the  tests  is  the  case  in  which  the  tests 
are  independent.  In  this  case,  the  probability  of  accepting  a  lot  of  quality 
Vh  Vh  •  •  >  Pn  is  given  by  the  product  of  the  individual  probabilities  of 
acceptance: 

*JP\,Pll      •      •      •      ,    Pn     =    -Lpi    '  ^P2    ••'■'•  Lpn  W^j 

Since  each  LPi  is  1  or  less,  this  product  is  less  than  the  least  LVi  among  the 
set  of  n.  If  n  =  3  and  each  LPi  =  0.95,  the  product  is  (0.95)3  or  0.857; 
if  each  LPi  =  0.99,  the  product  is  (0.99) 3  or  0.970.  For  n  =  10  the  cor- 
responding products  are  0.599  and  0.904.  (When  LPi  is  near  1  and  n  is 
not  too  large,  the  probability  of  rejecting  is  about  n  times  as  large  for  n 
independent  tests  as  for  one.) 

Sometimes  the  situation  is  that  for  good  quality  the  various  tests  are 
independent  but  that  for  poor  quality  the  tests  are  positively  correlated. 
Thus,  if  for  each  of  five  characteristics  a  lot  has  fractions  defective  that 
give  it  a  95  percent  chance  of  acceptance  on  the  individual  tests,  its 
chance  of  passing  all  five  tests  is  about  (0.95)5  or  0.774.  But  if  its  chance 
of  acceptance  on  each  individual  test  is  0. 10,  its  chance  of  acceptance  on 
all  five  may  be  nearly  0.10.  In  such  situations,  not  only  is  the  OC  curve 
for  the  battery  of  tests  lower  than  that  for  any  one  test,  but  it  gives  less 
sharp  discrimination  between  good  and  bad  lots.  On  the  other  hand,  if 
the  tests  are  independent  or  positively  correlated  for  good  quality  but 
negatively  correlated  for  bad  quality,  the  discrimination  is  sharpened. 

Another  aspect  of  the  simultaneous  use  of  several  criteria  like  (95)  is 
that  this  treats  the  various  tests  on  an  attribute  basis,  so  to  speak.  If  a 
lot  just  barely  fails  to  pass  one  test  but  appears  to  be  of  exceptionally  high 
quality  on  all  other  tests,  it  is  rejected;  whereas  a  second  lot  that  just 
barely  passes  each  test  is  accepted.  It  would  be  preferable  to  combine 
all  the  measurements  on  one  item  into  a  single  score  for  that  item  and 
judge  the  lot  by  the  scores  of  the  sample  items.  * 

All  these  uncertainties  about  the  simultaneous  operation  of  several 
criteria  make  the  selection,  operation,  and  interpretation  of  the  plan  much 
less  clear-cut  and  straightforward  than  when  only  one  criterion  is  con- 

and  partly  by  the  seller's  disclaiming  responsibility  for  the  quality  of  the  product, 
reasoning  that  he  could  take  responsibility  either  for  the  quality  of  the  product  or  for 
following  detailed  instructions  for  making  it,  but  not  for  the  consistency  of  those 
instructions  with  the  desired  quality. 

*  Chapter  3  deals  with  this  problem.  The  methods  presented  there  are  new  and 
have  not  yet  been  used  extensively  or  developed  to  the  point  where  they  are  com- 
parable with  those  of  this  chapter  in  simplicity  of  selecting  plans  and  constructing 
OC  curves,  or  in  ease  of  routine  operation  and  interpretation.  They  will  nevertheless 
well  repay  careful  study,  especially  when  complicated  and  expensive  products  or 
experiments  are  concerned. 
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cerned.  They  underscore  the  necessity,  if  variables  are  to  be  used,  of  a 
careful  preliminary  analysis  of  a  rather  large  body  of  data  on  the  product 
to  which  the  plans  are  to  be  applied. 

10.2.  Upper  and  Lower  Bounds  on  a  Single  Quality  Characteristic 

10.2.1.  General  Considerations. — The  OC  curve  for  two  simultaneous 
criteria  of  the  form  (96)  can  be  determined  mathematically  in  the  case 
where  the  two  criteria  simply  impose  upper  and  lower  limits  on  a  single 
characteristic;  that  is,  the  case  where  an  item  is  defective  if  it  is  either 
above  some  upper  limit  U  or  below  some  minimum  M.  Some  mathe- 
matical investigations  of  this  have  been  made,*  but  no  completely  satis- 
factory solution  has  yet  been  obtained.  The  indications  are,  however, 
that  for  practical  purposes  it  Avill  ordinarily  suffice  to  use  two  criteria  of 
the  form  (96)  determined  independently  for  the  upper  and  lower  limits. 
Two  points  about  this  procedure,  however,  should  be  recognized. 

The  first  point  is  that  the  combinations  of  x  and  s  that  lead  to  accept- 
ance are  not  quite  the  same  when  two  one-sided  criteria  are  used  as  they 
would  be  if  a  single  two-sided  test  were  constructed.  The  combinations 
that  lead  to  acceptance  with  two  one-sided  tests  (having  the  same  values 
of  N  and  k)  are  included  in  an  area  shaped  like  the  triangle  of  Fig.  1.8, 
whereas  a  single  two-sided  test  would  result  in  rounding  off  the  top  corner 
of  this  triangle,  somewhat  as  in  the  curve  of  Fig.  1.8.  If  most  of  the 
production  should  fall  in  the  small  area  not  common  to  both  regions,  this 
difference  would  be  of  importance.  This  would  be  a  situation  in  which 
the  mean  behaved  excellently,  consistently  lying  midway  between  M 
and  U,  but  the  standard  deviation  was  continually  near  the  maximum 
value  permitting  acceptance  with  the  best  value  of  the  mean.  If  such  a 
situation  were  to  prevail  consistently,  acceptance  inspection  would 
probably  be  based  on  the  standard  deviation  alone. 

The  second  point  to  be  kept  in  mind  in  using  two  one-sided  tests 
for  a  given  characteristic  is  the  effect  on  the  OC  curve.  The  true  OC 
curve  for  a  two-sided  test  cannot  be  expressed  in  terms  of  the  lot  fraction 
defective  alone;  it  depends  upon  the  lot  mean  and  standard  deviation.  A 
sample  that  fails  to  meet  one  criterion  because  of  a  large  standard  devia- 
tion is  likely  to  fail  the  other  criterion  also;  but  a  sample  that  fails  one 
criterion  because  of  an  extreme  mean  is  likely  to  pass  the  other  criterion. 

Thus,  although  the  use  of  two  one-sided  criteria  to  control  both  upward 
and  downward  variation  often  works  out  satisfactorily  in  practice,  it 
should  be  recognized  as  an  improvisation  for  a  problem  that  is  in  essential 
respects  different  from  the  one-sided  problem  and  is  less  amenable  to 
simple  methods  and  interpretations. 

*  The  mathematical  work  was  done  in  the  Statistical  Research  Group  by  A.  Wald 
and  J.  Wolfowitz  but  has  not  been  published. 
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10.2.2.  Use  of  a  Pair  of  One-sided  Criteria. — Suppose  two  tests  are 
constructed  so  that  with  one  the  probability  is  (3  that  a  lot  with  fraction  p 
(p  <  0.5)  above  U  will  be  accepted  and  with  the  other  the  probability  is  (3 
that  a  lot  with  fraction  p  below  M  will  be  accepted.  If  the  standard 
deviation  of  the  measurements  is  small,  the  probability  is  practically  /? 
that  a  lot  with  fraction  p  outside  the  limits  M  and  U  will  be  accepted ;  but 
if  the  standard  deviation  is  large,  the  probability  is  practically  (3  that  a 
lot  with  fraction  2p  outside  the  limits  M  and  U  will  be  accepted.  If  p/2  is 
substituted  for  p,  the  test  will  give  the  intended  probability  of  acceptance 
for  high  values  of  the  standard  deviation  but  not  for  low  values  of  the 
standard  deviation. 

If  the  standard  deviation  is  expected  always  to  be  low — it  need  not  be 
so  low  when  the  relevant  values  of  p  are  small  as  when  they  are  large — 
or  if  lots  with  high  standard  deviations  are  sufficiently  infrequent  as  to  have 
only  negligible  effects  on  the  operation  of  the  tests,  a  pair  of  one-sided 
tests  may  be  used  successfully.  The  two  acceptance  criteria,  both  of 
which  must  be  satisfied  before  a  lot  is  accepted,  are  then 

x  +  ks  <  U        and         x  -  ks  >  M  (97) 

It  may  be  noted  that  if  x  >  (M  +  U)/2,  the  second  inequality  can  be 
ignored,  for  it  will  surely  be  met  if  the  first  is ;  similarly,  if  x  <  (M  +  U)  /2 , 
the  first  inequality  can  be  ignored. 

As  in  the  case  of  a  single  criterion,  we  may  note  that  if 

X  >  NU        or         X  <  NM  (98) 

the  lot  may  be  rejected  without  regard  to  Y,  where  X  and  Y  are  the  sum  of 
the  observations  and  the  sum  of  the  squares  of  the  observations,  respec- 
tively, as  defined  by  formulas  (51)  and  (52). 
Second,  we  may  note  that  if 

the  lot  may  be  accepted. 

Third,  if  neither  (98)  nor  (99)  is  satisfied,  the  lot  is  accepted  if 

X  >  N  M^U        and         Y  <  aX>  -  bvX  +  Cu  (100) 


or 


X  <  N  —  't  U        and         Y  <  aX*  -  bMX  +  cM  (101) 


where 


Nk2  +  N  -  1 

N2k2 


(102) 
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(103) 


,      _  2U(N  -  1)         ,          2M(N  -  1) 
bu N¥     '       bM N¥ — 


U2(N  -  1)  M\N  -  1) 

Cv  =  p Cm  =  ^ '-  (104) 

If  neither  (100)  nor  (101)  is  met,  the  lot  is  rejected. 

For  adjusted  observations,  as  defined  by  (71),  the  lot  may  be  accepted 
Avithout  regard  to  X  if 

Y,  <  (u  +  aw  - 1)     when     a>m  +  u     (l05) 


or 


v,    .  (A  -  M)\N  -  1)  ,  . ■    .  M  +  U 

Y><'- W^T-  when         A<^2— 

In  place  of  (100)  and  (101)  we  have 


(106) 


X  >  N  M  ±  -        and        Y'  <  aX>  -  (bv +  2A)X  +  (Cu  +  NA*)     (107) 

X  <  N  M  j  U       and       Y'<aX*-(bM  +  2A)X  +  (cM  +  NA*)     (108) 

The  most  efficient  form  of  (105)  and  (106),  either  for  graphical  pur- 
poses or  for  permitting  immediate  acceptance  on  the  basis  of  Y'  alone,  is 
obtained  by  setting 

A=^JL  (109) 

In  this  case  inequalities  (105)  and  (106)  are  identical. 

For  either  adjusted  or  unadjusted  observations  graphs  may  be  con- 
structed as  described  in  Sec.  7.3.  The  maximum  value  of  Y  for  accept- 
ance for  each  value  of  X  will  be  obtained  from  formula  (100)  or  (107) 
for  values  of  X  greater  than  N(M  +  U)/2  and  from  formula  (101)  or  (108) 
for  values  of  X  less  than  N(M  +  U)/2.  In  the  case  of  adjusted  observa- 
tions, if  A  =  (M  +  U)/2  the  figures  will  be  symmetrical  about  the  line 
X  —  NA.  The  upper  boundaries  of  the  error  region  may  be  found  by 
formula  (70)  or  (74)  of  Sec.  7.3. 

10.2.3.  Two-sided  Criterion. — A  two-sided  criterion  is  most  easily 
applied  if  expressed  in  terms  of  a  chart  similar  to  Fig.  1.8,  in  which  values 
of  x  are  represented  horizontally  and  values  of  s  vertically.  For  each 
sample,  a  point  is  plotted  corresponding  to  the  observed  x  and  5.  If  this 
point  lies  within  the  region  bounded  by  the  acceptance  curve  and  the 
horizontal  axis,  the  lot  is  accepted;  otherwise,  it  is  rejected. 

The  first  step  in  finding  the  boundary  of  the  acceptance  region  is  to 
compute  k  and  N  from  formulas  (10)  and  (11).  The  values  of  pi  and  p2 
now  represent  fractions  outside  the  interval  M  to  U,  instead  of  fractions 
above  U. 
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The  second  step  is  to  find  the  indifference  quality  p0,  which  is  simply 
the  probability  that  a  normal  deviate  will  exceed  k.  It  may  be  deter- 
mined from  Table  1.5. 

The  third  step  is  to  divide  p0  arbitrarily  into  two  parts,  p'0  and 

Po  (Vo  <  Po)  sucn  that 

Po  +  P"  =  Po  (HO) 

Each  pair  p'0,  p"  leads  to  two  points  on  the  acceptance  curve,  so  enough 
different  pairs  should  be  used  to  permit  plotting  the  curve. 
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Fig.  1.8. — Acceptance  region  for  a  two-sided  test  denned  by  pi  =  0.15,  %>i  =  0.30, 
a.  =  0.01,  j8  =  0.02,  U  =  35,  and  M  =  25;  also  corresponding  region  for  a  pair  of  one-sided 
tests.     [Data  from  Table  1.7,  columns  (9),  (10)  and  (11).] 

The  fourth  step  is  to  find  Kp'0  and  Kp'0',  normal  deviates  exceeded  with 
probabilities  p'0  and  p",  respectively.  These  may  be  obtained  from  Table 
1.1. 

The  fifth  step  is  to  substitute  each  pair  of  values  KP'Q,  Kp'0'  into  the 
pair  of  equations 


UK, 


MKX 


Kp'  +  Kj 


s 


U  -  M 

Kp'  +  Kp 


cm) 
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or  into  the  equivalent  pair 

_  =  U  +  M       U  -  M    Kp'0-  KP'Q'  _  U  -M  2 

2         !         2         Kp'0  +  Kp'0'  2         Kp'0  +  Kp'o' 

Each  pair  of  values  Kp'0,  Kp'0'  when  substituted  into  (111)  or  (112)  leads  to 
a  combination  x,s  lying  on  the  boundary  of  the  acceptance  region;  another 
point  on  the  boundary  is  given  by  the  combination  x',s  where 

x'  =  U  +  M  -  x  (113) 

Table  1.7  shows  the  computations  for  the  two-sided  acceptance  region 
defined  by  pi  =  0.15,  p2  =  0.30,  a  =  0.01,  and  (3  =  0.02,  where  defects 
are  represented  by  values  below  25  or  above  35.  The  values  N  =  95  and 
k  =  0.7645  are  taken  from  Sec.  4.3.  The  resulting  region  is  plotted 
in  Fig.  1.8.  The  figure  also  shows  the  acceptance  region  given  by  using 
the  pair  of  one-sided  criteria 

x  -  0.7645s  >  25         x  +  0.7645s  <  35  (114) 

It  will  be  observed  that  the  acceptance  region  for  the  two-sided  test  lies 
entirely  within  that  for  the  pair  of  one-sided  tests.  As  x  approaches  U  or 
M,  the  two  boundaries  nearly  coincide.  The  difference  between  the 
regions  is  fairly  sizable  in  this  case,  which  involves  rather  large  fractions 
defective,  but  for  smaller  fractions  defective  the  two  regions  more  nearly 
coincide.  For  example,  keeping  M  =  25  and  U  =  35,  but  taking 
pi  =  0.001,  p2  =  0.0015,  a  =  0.05,  (3  =  0.10  (for  which  Table  1.2  shows 
k  =  3.0214)  we  find  that  for  x  =  30  the  maximum  acceptable  value  of  s 
is  given  by  the  pair  of  one-sided  tests  as  1.655  and  by  the  two-sided  test  as 
1.550;  and  the  difference  is  less  for  any  other  value  of  x.  The  effect 
of  the  difference  between  a  pair  of  one-sided  tests  and  a  two-sided  test 
should  be  judged  by  the  frequency  of  lots  leading  to  sample  points  that 
would  signify  acceptance  by  one  test  and  rejection  by  the  other.  One  way 
in  which  the  two-sided  test  presented  here  is  useful  is  in  judging  whether 
two  one-sided  tests  can  be  used  satisfactorily. 

If  a  two-sided  test  is  used,  criteria  for  reducing  or  tightening  inspec- 
tion may  also  be  expressed  graphically.  Upper  and  lower  limits  of  k0 
should  be  computed  as  described  in  the  last  paragraph  of  Sec.  8.2.2  and 
used  as  values  of  k  from  which  to  compute  two  boundary  curves  accord- 
ing to  formulas  (111)  or  (112).  If  the  value  of  k0  found  in  computing  the 
PA  lies  between  the  two  curves,  normal  inspection  is  continued ;  if  it  falls 
below  the  lower  curve,  tightened  inspection  is  installed;  and  if  it  falls 
above  the  upper  curve  reduced  inspection  is  installed. 

11.  TECHNICAL  NOTES 
11.1.  Basic  Formulas 

11.1.1.  Assumptions. — Sampling  plans  for  controlling  percent  defec- 
tive on  the  basis  of  variables  involve  drawing  a  sample  of  N  at  random 


58  TECHNIQUES  OF  STATISTICAL  ANALYSIS  Chap.  1 

(115) 

(116) 

(117) 

x  represents  measurements  of  the  variable  being  inspected,  and  &  is  a 
constant. 

If  x  is  normally  distributed,  x  will  be  normally  distributed  with  mean 
H  and  standard  deviation  <r/\/~N,  where  fx  and  a  are  the  lot  mean  and 
standard  deviation  of  x.  Also,  unless  N  is  extremely  small,  say,  less  than 
5,  s  will  also  be  nearly  normally  distributed,  its  mean  and  standard 
deviation  being  aa  and  ba/y/2(N  —  1),  respectively,  where 


from  a 

lot  and 

computing 

x  +  ks 

where 

x  = 

2x 
N 

s2  = 

iV2x2  - 
N(N  - 

(2x)2 
-  1) 

Hi) 


N 
a  =  - — -r^— ¥^  (118) 

b  =  V2(iV  -  1)(1  -  a)2  (119) 

The  factors  a  and  b  are  slightly  less  than  1  —  when  N  =  5,  for  example, 
a  =  0.940  and  b  =  0.965  —  and  they  increase  toward  1  as  N  increases.* 
They  are  therefore  treated  as  1  throughout  this  chapter;  that  is,  the  stand- 
ard deviation  is  regarded  as  normally  distributed  with  mean  a  and  stand- 
ard deviation  a/\/2(N  —  1). 

Since  a  linear  combination  of  independently  distributed  variables 
tends  to  be  normally  distributed  even  when  the  individual  variables 
depart  from  normality,  x  +  ks  will  be  more  nearly  normally  distributed 
than  s;  in  fact,  x  +  ks  will  tend  toward  normality  even  if  the  distribution 
of  x  departs  from  normality.  The  mean  of  x  +  ks  is  fi  +  ka  and  the 
standard  deviation  is 


°yji 


Y  +  2(/=T)  (120) 


*' A  table  of  a  and  b  is  given  in  Jennett  and  Welch,  "Control  of  Proportion  Defec- 
tive," p.  85.  This  table  gives  a  and  b  to  3  decimal  places  for  values  of  N  from  5  to  15 
by  steps  of  1,  from  15  to  30  by  steps  of  5,  and  for  40,  50,  75,  and  100.  A  table  of 
ci  =  a  \/(N  —  1)/N  to  5  decimal  places  for  values  of  N  from  3  to  25  by  steps  of  1  and 
from  25  to  100  by  steps  of  5  is  given  by  Shewhart,  Economic  Control  of  Quality,  p.  185. 
Values  of  C2  =  a  \/(N  —  1)/N  are  given  to  4  decimal  places  for  values  of  N  from  2  to 
25  by  steps  of  1  in  American  Society  for  Testing  Materials  Manual,  p.  50  of  Supple- 
ment B;  the  same  table  is  given  in  American  Standards  Association,  Control  Chart 
Method,  p.  39. 
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11.1.2.  Direct  Derivation. — The  probability  that  in  a  sample  x  +  ks 
will  be  less  than  U  then  depends  on  the  difference  between  U  and  i±  -\-  ka, 
measured  in  units  of  the  standard  deviation  of  x  +  ks;  that  is,  the  proba- 
bility that  x  +  ks  will  be  less  than  U  is  the  probability  that  a  unit  normal 
deviate  will  be  less  than 

U  -  0.  +  fcr)  (m 


\JV 


+   *2 


N   l  2{N  -  1) 

The  probability  will  be  e  that  the  test  will  result  in  acceptance  if 
U  -  0*  +  fer) 


*>& 


JL2 

+ 


-Ke  =  Xi_£  (122) 


N  ■     2(N  -  1) 
where  K€  is  defined  by 

^/*Y5*-'  (123) 

We  may  write  (122) 

(U  —  fi.)/<T  is  a  unit  normal  deviate  that  will  be  exceeded  with  some 
probability  p,  where  p  is  the  proportion  defective  in  the  lot,  so  we  may 
write 

Kp  =  ^^  (125) 

a 

Substituting  this  into  (124)  and  rearranging,  we  find 


k  =  Kp  +  KeyJ±  +  0//v;    ,, 


k- 

N    '    2(iV  -  1) 


If  both  sides  of  (126)  are  decreased  by  Kp  and  then  squared,  a  rear- 
rangement of  terms  gives 

(l-wh])ki-2K>k  +  (K*-§)  =  0      (127) 

Denoting  the  coefficient  of  k2  by  a  and  the  term  independent  of  k  by  b,  k 
may  be  expressed  in  terms  of  N,  Kp,  and  Ke  as 

k  =  K»  +  VK»  ~  ab  (128) 

a 

If  Kp  +  k  is  subtracted  from  both  sides  of  (126)  and  the  equation 
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multiplied  by  —  1,  we  have  an  equation  expressing  the  fraction  defective, 
in  terms  of  a  normal  deviate,  as  a  function  of  N,  k,  and  e;  that  is,  the 
abscissa  of  the  OC  curve  is  expressed  as  a  function  of  the  ordinate  of  the 
OC  curve  and  of  the  parameters  defining  the  sampling  plan. 

Suppose  we  desire  to  choose  N  and  k  so  that  the  probability  will  be 
1  —  a  of  accepting  a  lot  of  quality  pi  and  /3  of  accepting  a  lot  of  quality 
p2,  where  a  +  j3<  1  and  p2  >  pi.  Substituting  p1  and  1  —  a  into  (126) 
gives  one  equation  in  N  and  k,  and  substituting  p2  and  (3  gives  another 


k  =  K1  +  K^a^J±  +  _; 


N    *   2(N  -  1) 


*  =  X2  +  X,^H-   ,(  /      r 


Jc2 

AT    '    2{N  -  1) 


where  i£i  and  K2  have  been  written  instead  of  KPl  and  i£P2.  After  sub- 
tracting Ki  from  (129)  and  K2  from  (130)  and  dividing  (129)  by  (130), 
we  have 

k  -  Kl       K^a 


k  -  K2        Kp 

Solving  this  for  k  and  replacing  K\-a  by  —  Ka  gives 


(131) 


fc  "  T^IT  (132) 

AT  may  then  be  determined  by  substituting  k  into  (129)  or  (130)  and 
solving.  The  solution  will  be  considerably  simpler  and  there  will  be 
little  if  any  loss  in  accuracy  if  in  the  denominator  of  the  second  term  under 
the  radical  AT  —  1  is  replaced  by  N.  With  this  approximation  (129) 
becomes 


*    !    ^J1  +|  {[XU 


VN\  2 


Subtracting  Ki  from  both  sides,  multiplying  by  \/N/(Ki  —  k),  squaring, 
and  replacing  KX-a  by  —  Ka  yields 


"  ■  IT 

Similarly,  (130)  leads  to 


N  =  1W ST  (134) 


Kl 


M) 

fc  -  #2)2 
By  adding  (134)  and  (135)  and  replacing  the  A:  of  the  denominators  by 


N  =  -W^F  (135) 
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(132),  N  may  be  expressed  as  in  (12);  and  by  substituting  for  k  in  the 
numerators  as  well  as  the  denominators,  N  may  be  expressed  directly  in 
terms  of  Kh  K2}  Ka,  and  Kp,  as  in  (11). 

If  instead  of  replacing  N  —  1  by  N  in  (129)  and  (130)  we  replace  N 
by  N  —  1,  the  right-hand  sides  of  (134)  and  (135)  are  not  affected;  but 
the  left-hand  sides  become  N  —  1.  This  approximation  therefore  leads 
to  a  value  of  N  one  unit  larger  than  that  given  by  (134)  and  (135).  Since 
AT  must  be  an  integer,  the  effects  of  rounding  are  of  the  same  order  of 
magnitude  as  those  of  replacing  N  —  1  by  N.  It  is  because  of  this 
approximation  that  values  of  N  given  by  (134)  and  (135)  that  are  not 
integers  should  be  rounded  up  instead  of  to  the  nearest  integer.  If  the 
approximation  is  not  made,  we  find 


where 


N  =  fe  +  V^-16a  (136) 

b  =  k2  +  2(<a  +  1)  (138^ 

11.1.3.  Derivation  from  General  Formulas  for  Sample  Size  and 
Critical  Level. — As  an  alternative  derivation  of  formulas  (132)  and 
(134),  we  may  start  from  the  fact  that  x  +  ks  is  normally  distributed 
about  a  certain  mean  in  good  lots  and  about  another  mean  in  bad  lots  and 
employ  the  ordinary  procedures  for  testing  the  mean  of  a  normally 
distributed  variable.  Let  z  represent  a  normally  distributed  variable 
whose  mean  is  Z  and  whose  standard  deviation  is  a  function  of  the  sample 
size  of  the  form 

cr(N)  =  a-F(N)  (139) 

where  F(N)  is  any  function  of  N  that  is  independent  of  a.  Suppose  a 
test  is  required  such  that  the  probability  is  a  of  rejecting  a  lot  in  which 
the  mean  is  Z\  and  (3  of  accepting  a  lot  in  which  the  mean  is  Z2,  where 
Z2  >  Z\.  The  test  is  made  by  finding  2ina  sample  of  N  and  accepting 
the  lot  if  z  <  A  or  rejecting  it  if  z  >  A,  where  A  is  a  constant.  Then 
if  a  lot  has  a  mean  of  Zi,  the  probability  Lx  of  acceptance  is 


L  l  fA 

1        <7(iV)V/27rJ_0 


d-ZO' 


e     2°2W  dz  (140) 


A-Zi 


where 


t/- "2rf/ 


1  =  W  (141) 


62  TECHNIQUES  OF  STATISTICAL  ANALYSIS  Chap.  1 

If  the  lot  mean  is  Z2  the  probability  L2  of  acceptance  is 

U  =     /ATN      _   /       e    WW  dz  (142) 

<r(N)  V2*  J-* 

A- 

1        f"7i 


™  e"*dt 


where  t  is  defined  by  (141)  but  in  terms  of  Z2  instead  of  Z\.     The  appro- 
priate test  is  then  defined  by  values  of  A  and  N  such  that 

Li  =  1  -.  a  (143) 

L2  =  (3  (144) 

that    is, 

w  =  x-  (145) 

w  =  -*  (146) 

On  dividing  (145)  by  (146)  and  solving  for  A,  we  find 

.        KaZ2  +  KpZi 


Ka  +  Kt 


(147) 


On  multiplying  both  (145)  and  (146)  by  <x(N),  subtracting  (146)  from 
(145),  replacing  <r(]V)  by  <r  •  F(N),  and  solving  for  F(N),  we  find 

In  the  ordinary  case  of  testing  the  mean  of  a  normally  distributed  varia- 
ble, F(N)  is  simply  l/\/N,  and  this  leads  to  N  =  l/a,  where  a  is  defined 
by  (137). 

In  applying  formulas  (147)  and  (148)  to  the  case  of  percent  defective, 
we  have 

z  =  x  +  ks  (149) 

Z  =  ix  +  hr  (150) 

F^  -  Vf  +  2^[)  ^ 

A  lot  is  considered  good  if 

"  >  K1  (152) 


a 
and  bad  if 

^— -  <  K2  (153) 

<j 

where  Ki  >  K2.     From  (152),  using  the  equality  sign,  we  find 

n  =  U  -  Kkt  (154) 
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Substituting  this  into  (150)  gives 

Z1=  U  +  (k  -  Kx)(t  (155) 

Similarly, 

Z2=  U  +  (k  -  K2)a  (156) 

Substituting  from  (151),  (155),  and  (156)  into  (148),  we  have 


4 


N^  2(N  -  1)       Ka  +  KP  l       } 


for  which  (136)  is  the  solution.     Substituting  from  (155)  and  (156)  into 
(147)  gives 

,       Ka[U  +  (k  -  KM  +  KA  U+  (k  -  KM 


Ka  +  Kp 
Ka(U  +  ka)  +Kp(U  +  ka)  _  KaK2  +  KpKi 
Ka  +  K?  Ka  +  K? 


(158) 


A         TT    I    7          KaK2  +  KpKi  ,1tm 

A  =  (7  +  for ^ — : — yt —  °"  (159) 

A«  +  Aj3 

Any  combination  of  values  of  k  and  A  that  satisfy  this  equation  will  be 
satisfactory  values  for  the  multiple  of  the  standard  deviation  and  the 
rejection  level  of  x  +  ks,  but  most  pairs  will  be  unusable  because  calcu- 
lation of  the  rejection  level  A  from  (159)  will  require  knowledge  of  a. 
If,  however,  we  set 

_  KaK2  +  KyKx 
k~       Ka  +  K,  (lb0) 

this  problem  is  solved  by  the  disappearance  of  a,  and  the  rejection  level 
becomes  simply 

A  =  U  (161) 

The  value  of  k  from  (160)  may  then  be  substituted  into  (157)  to  deter- 
mine N.  If  we  replace  N  —  1  by  N,  the  value  of  N  given  by  (157)  is  the 
same  as  that  given  by  formulas  (134)  and  (135),  otherwise  it  is  the  same 
as  that  given  by  (136). 

llcl.4.  Two-sided  Criterion. — The  basis  for  the  two-sided  test  pre- 
sented in  Sec.  10.2.3  is  the  fact  that,  because  x  +  ks  can  be  assumed 
symmetrically  distributed  about  m  +  ka,  the  boundary  between  accept- 
able and  unacceptable  combinations  of  x  and  s  corresponds  with  the 
boundary  between  combinations  of  \i  and  a  for  which  a  lot  has  more  or 
less  than  a  50  percent  chance  of  acceptance.  Thus,  in  Fig.  1.8  if  the  x 
and  s  axes  are  interpreted  as  n  and  a  axes,  any  lot  represented  by  a  point 
on  the  boundary  line  has  an  equal  chance  of  acceptance  or  rejection, 
that  is,  corresponds  with  the  indifference  percent  defective.  For  the 
two-sided  test  it  is  clear  that  the  indifference  quality  is  the  same  as  for  a 
one-sided  test;  for  when  a  is  very  small  and  ji  is  near  U  or  M,  the  two- 
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sided  and  one-sided  tests  become  virtually  equivalent.  Thus,  the  pro- 
cedure for  finding  the  boundary  of  the  two-sided  acceptance  region  begins 
by  finding  the  proper  indifference  quality  po.  For  any  value  of  /z,  a 
value  of  a  is  found  such  that  the  fraction  p'0  below  M  plus  the  fraction  p" 
above  U  is  equal  to  p0;  these  values  of  n  and  <j  are  then  taken  as  the 
coordinates  of  a  point  x,s  on  the  acceptance  boundary.  For  computa- 
tional purposes,  starting  with  fx  and  finding  the  proper  a  is  cumbersome, 
but  the  parametric  equations  (111)  and  (112),  which  start  with  an 
arbitrary  division  of  po  into  two  parts  p'Q  and  p",  are  easily  used.  Given 
pfQ  and  p",  we  have 

n  -  KPaa  =  M  (162) 

and 

ul  +  KP0'a  =  U  (163) 

Equations  (111)  and  (112)  represent  solutions  of  (162)  and  (163)  for  ju 
and  a  in  terms  of  U,  M,  Kp'0,  and  Kp'0'f  and  thus  give  the  coordinates  of  a 
point  x,s  on  the  acceptance  boundary. 

11.2.  Approximations  in  the  Basic  Formulas 

Four  incorrect  assumptions  are  made  in  deriving  the  preceding 
formulas : 

(1)  That  x  +  ks  is  normally  distributed;  actually,  it  is  not  quite 
normally  distributed,  since  s  is  not. 

(2)  That  s  is  distributed  about  a  mean  of  cr;  actually,  it  is  distributed 
about  a  mean  of  aa,  where  a  <  1  and  is  defined  by  equation  (118). 

(3)  That  s  is  distributed  with  standard  deviation  a/\^2(N  —  1); 
actually,  it  is  distributed  with  standard  deviation  of  bv/\/2(N  —  1),  where 
6  <  1  and  is  defined  by  equation  (119). 

(4)  That  s  is  distributed  with  standard  deviation  a/\/2N.  This 
assumption  applies  only  to  formulas  (134)  and  (135)  for  N. 

All  these  assumptions  can  be  avoided  by  utilizing  the  non-central 
t  distribution,  tables  of  which  have  been  published  by  Johnson  and 
Welch.  In  fact,  the  values  of  N  and  k  and  the  corresponding  true  values 
of  pi  and  pi  given  in  Table  1.2  were  computed  with  the  aid  of  tables  of 
the  non-central  t  distribution.  The  effect  of  the  approximations  in  the 
formulas  can  therefore  be  examined  by  comparing  the  values  they  give 
with  those  given  in  Table  1.2.  Such  a  comparison  exaggerates  somewhat 
the  inaccuracy  of  the  formulas  because  in  preparing  the  table  some  effort 
was  expended,  as  is  explained  in  Sec.  11.5,  to  make  the  best  adjustment 
to  the  discontinuous  nature  of  N,  rather  than  entering  the  first  values  of 
N  and  k  found.  Similar  adjustments  could  be  made  in  using  the  for- 
mulas. Section  4.7  includes  a  comparison  between  the  results  given  by 
the  formulas  and  those  in  Table  1.2,  the  comparison  being  made  for  the 
smallest  sample  size,  namely,  6,  included  in  Table  1.2.     (It  will  be  noted 
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that  all  four  of  the  incorrect  assumptions  underlying  the  formulas  rapidly 
become  very  nearly  correct  as  N  increases.) 

TABLE  1.8 

Comparison  of  Approximate  and  Exact  Values  of  N  and  k 
The  exact  values  of  k  in  column  (2)  have  been  computed  from  the  non-central 
t  distribution,  taking  pi  =  0.05,  a  =  0.01,  and  N  as  shown  in  column  (1).  The  exact 
values  of  p2  for  which  /3  =0.10  have  then  been  computed  and  entered  in  column  (3). 
From  these  p2,  taking/?  =  0.10,  pi  =  0.05,  anda  =  0.01,  approximate  values  of  N  and  k 
have  been  computed  from  formulas  (10)  and  (11)  or  (12).  The  true  values  of  py 
and  p2  for  which  a  =  0.01  and  |3  =0.10  have  been  computed  from  the  non-central  t 
distribution. 


Exact 

Approximate 

N 

k 

Vi 

N 

k 

true  pi 

true  pi 

(1) 

(2) 

(3) 

(4) 

(5) 

(6) 

(7) 

5 

0.5445 

0 . 5428 

5 

0.5150 

0.0541 

0.5519 

10 

0.8037 

0.3774 

10 

0.7856 

0.0524 

0 . 3830 

15 

0 . 9292 

0 . 3033 

15 

0.9163 

0.0517 

0.3071 

20 

1.0083 

0.2604 

20 

0 . 9983 

0.0513 

0 . 2633 

25 

1 . 0643 

0.2332 

25 

1.0560 

0.0510 

0 . 2344 

30 

1 . 1069 

0.2119 

30 

1 . 1000 

0.0509 

0.2137 

35 

1 . 1409 

0 . 1966 

35 

1 . 1348 

0.0507 

0.1981 

40 

1 . 1688 

0.1846 

40 

1 . 1633 

0.0507 

0 . 1859 

45 

1 . 1922 

0.1748 

45 

1 . 1874 

0.0506 

0.1759 

50 

1.2125 

0.1666 

50 

1 . 2082 

0 . 0505 

0.1676 

65 

1.2592 

0 . 1489 

64 

1.2556 

0.0501 

0 . 1500 

75 

1.2828 

0 . 1404 

74 

1 . 2797 

0.0501 

0.1413 

100 

1.3264 

0 . 1256 

99 

1.3241 

0.0501 

0.1262 

Table  1.8  shows  a  few  comparisons  of  the  values  of  k  and  N  given 
by  formulas  (132)  and  (134)  or  (135)  with  the  exact  values.  In  these 
cases  a  has  been  fixed  at  0.01  and  pi  at  0.05.  Various  values  of  N  have 
been  selected  and  the  resulting  lot  quality  p2  for  which  (3  =  0.10  has  been 
calculated  from  the  non-central  t  distribution.  Thus  the  exact  values  of 
N  and  k  result  in  an  OC  curve  passing  exactly  through  the  points  defined 
by  pi,a  and  $2,(3.  If  Vha  and  Vifi  are  both  selected  arbitrarily,  it  is  not 
usually  possible  to  find  values  of  N  and  k  such  that  the  OC  curve  passes 
exactly  through  both  points.  Thus  the  exact  values  of  N  and  k  given 
in  Table  1.8  are  more  exact  than  could  ordinarily  be  achieved  in  practical 
problems,  while  those  given  by  the  formulas  are  characteristic  of  the 
accuracy  realizable  in  practice.  Columns  (6)  and  (7)  of  the  table  show 
the  true  values  of  pi  and  p2  for  which  a  =  0.01  and  p  =  0.10  when  the 
approximate  N  and  k  are  used.  The  differences  between  0.05  and  the 
values  in  column  (6)  and  between  the  values  in  columns  (3)  and   (7) 
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are  a  measure  of  the  effect  of  the  approximations.  An  alternative  meas- 
ure would  be  the  error  in  the  true  a  and  /3  at  the  intended  px  and  p2;  this 
error  would  probably  be  smaller  than  that  in  p±  and  p2  because  the  OC 
curves  are  usually  fairly  flat  in  the  neighborhood  of  pi  and  p2.  While  it 
cannot  be  claimed  that  Table  1.8  gives  a  representative  coverage  of  all 
kinds  of  plans,  it  can  be  claimed  that,  for  those  comparisons  given,  the  dis- 
crepancy between  the  approximate  and  exact  results  is  maximized. 

11.3.  Computational  Formulas 

The  use  of  the  acceptance  criterion 

x  +  ks  <  U  (164) 

requires  the  calculation  of  the  sample  mean  x  and  the  standard  deviation 
s.  The  acceptance  criterion  can,  however,  be  expressed  in  terms  of 
simpler  functions  of  the  observations,  namely, 

X  =  2x      (sum  of  the  observations)  (165) 

Y  —  Hx2     (sum  of  the  squares  of  the  observations)     (166) 

If  k  is  positive,  as  it  almost  invariably  will  be,  the  lot  may  be  rejected 
without  regard  to  Y  if 

X  >  NU  >  (167) 

In  this  case,  x  exceeds  U,  so  since  s  cannot  be  negative,  x  +  ks  exceeds  U. 
The  values  of  x  and  5  for  a  sample  of  N  observations  can  be  calculated 

from 

?x 


N  (168) 


and 


-4 


N*x>  -  (&)> 


N(N  -  1) 

In  terms  of  X  and  Y,  equations  (168)  and  (169)  can  be  written 

.       X 


(170) 


and 


4 


NW^D  (171) 

Replacing  x  and  s  in  (164)  by  their  values  from  (170)  and  (171)  gives 


H*^A/        \  i!7L>> 


X  NY  -  X2 

N  +     ^N(N  -  1) 


Subtracting  X/N  from  both  sides  of  inequality  (172)  and  squaring  gives 
NY>  -  X-  2XU      X"- 

n(n  -  i)  k  -  u     nr  +  N*  (173) 
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Clearing  fractions  and  rearranging  terms  gives 

Y  <aX2  -bX  +  c  (174) 

where 

Nk*  +  N  -  1  . 

«  -        Ar^  (175) 

,        2t7(i\T  -  1)  ,1W. 

5  =     m  (176) 

c  =  ^^1  (177) 

If  inequality  (174)  is  fulfilled,  the  lot  is  accepted;  if  inequality  (174)  is 
not  fulfilled,  the  lot  is  rejected. 

Using  the  equality  sign  in  (174)  gives  the  equation  of  a  parabola 

YR  =  aX2  -  bX  +  c  (178) 

which  shows,  for  each  value  of  X,  the  maximum  value  of  Y  permitting 

acceptance,  that  is,  the  upper  boundary  of  the  acceptance  region  or  the 

lower  boundary  of  the  rejection  region. 

The  minimum  value  of  the  curve  YR  gives  a  value  of  Y  below  which 

the  lot  is  acceptable  without  regard  to  X.     Differentiating  (178)  with 

respect  to  X  gives 

dY 

^  =  2'aX  -  b  (179) 

dY 
Setting  -p^  equal  to  zero  and  solving  for  X  gives 
dX 

1  - 1  (18°) 

Since 

55^  =  2a  >  0  (181) 

X  is  the  value  of  X  at  which  the  minimum  value  YR  of  YR  occurs.     Replac- 
ing X  in  (178)  by  the  value  of  X  given  by  (180)  gives,  after  simplification, 

Yr  =  c  -  j£  (182) 

Substituting  the  values  of  a,  b,  and  c  given  by  (175),  (176),  and  (177)  into 
(180)  and  (182),  and  simplifying,  gives 

±  _    UN{N  -  1) 

A  "  Nk*  +  N  -  1  (183) 

*    _    UW(N  -  1) 

Y"  ~  Nki  +  N  _  i  (184> 

v  v 

The  coordinates  of  the  minimum  point,  X  =  X  and  Y  =  YR,  can  be 
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approximated  by  replacing  by  N  the  N  —  1  in  the  denominators  of 

(183)  and  (184) 

Approximate  X  =      7  2  (185) 

jj2(N  —  1) 
Approximate  YR  =  — ,v  (186) 

These  approximate  values  are  smaller  than  the  exact  values  since  they 
were  obtained  by  increasing  the  denominators;  if,  therefore, 

Y  will  certainly  be  less  than  the  minimum  value  of  YR,  and  the  lot  may  be 
accepted  without  regard  to  X. 

The  acceptance  region  has  a  lower  boundary  corresponding,  for  each 
value  of  X,  with  the  minimum  value  cf  Y  algebraically  possible.  The 
sum  of  the  squares  of  the  deviations  of  the  observations  from  their  mean 
can  be  written 

2(z  -  x)2  =  2x2  -  ^-2  (188) 

or,  in  terms  of  X  and  Y 

S(x  -  x)r  =  Y  -~  (189) 


Since  2(z  —  x)2  cannot  be  negative, 


X 


Y>^  (190) 

X2 
For  any  given  sum  of  observations  X,  the  value  of  Y  will  be  -^  when  the 

sum  of  the  squares  of  the  deviations  of  the  observations  from  their  mean 

is  zero;  that  is,  when  all  the  observations  are  equal.     The  lower  boundary 

of  the  acceptance  region,   or  upper  boundary  of  the  error  region,   is, 

therefore, 

X2 
Y*  =  Jf  (191) 

A  sample  point  lying  below  this  line  would  indicate  a  computational 
error. 

By  equating  YR  and  YE  given  by  (178)  and  (191),  it  can  be  shown 
that  the  lower  and  upper  boundaries  of  the  acceptance  region  meet  at 
the  point  X  =  NU  and  Y  —  NU2.  The  two  curves  are  tangent  at  this 
point.  Beyond  it,  they  again  enclose  an  area  similar  to  the  acceptance 
region;  this  area  beyond  the  point  of  tangency  would  be  the  acceptance 
region  for  a  test  in  which  k  is  negative. 
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The  graphical  procedure  of  Sec.  7.3.1  consists  of  drawing  on  a  chart 
the  upper  and  lower  boundaries  of  the  acceptance  region,  given  by  (178) 
and  (191) ;  a  point  representing  the  sample  is  plotted  on  this  chart  and  the 
lot  accepted  or  rejected  according  as  the  point  lies  within  or  above  the 
acceptance  region.  This  graphical  procedure  is  impractical,  however, 
when  the  observations  are  large.  In  this  case,  the  range  of  variation  in  Y 
as  X  varies  is  large  relative  to  the  width  of  the  acceptance  region ;  and  the 
upper  and  lower  boundaries  of  the  acceptance  region  are  too  close  for 
most  practical  purposes. 

An  adjustment  of  the  observations  can  be  made  (a)  to  overcome  this 
difficulty  in  the  graphical  procedure  or  (6)  to  find  a  criterion  more  effective 
than  formula  (187)  in  permitting  immediate  acceptance.  The  adjust- 
ment consists  of  subtracting  an  adjustment  constant  A  from  each  of  the 
observations,  A  being  chosen  according  to  the  purpose  (a)  or  (6)  for 
which  the  adjustment  is  being  used.  It  is  not  necessary  to  adjust  each 
observation;  it  is  necessary  only  to  compute  the  sum  of  the  squares  of 
the  adjusted  observations  Y'  from  X  and  Y  by 

Y'  =  S(X  -  A)2  (192) 

=  Y  -  2AX  +  NA2 

Adjusting  YR  and  YE,  given  in  (178)  and  (191),  by  (192)  (that  is,  sub- 
tracting 2 AX  and  adding  NA2)  gives  the  upper  and  lower  boundaries  of 
the  acceptance  region 

Y'R  =  aX2  -  (6  +  2A)X  +  (c  +  NA2)  (193) 

Y'E  =  ~  -  2AX  +  NA2  (194) 

where  a,  b,  and  c  are  defined  by  (175),  (176),  and  (177). 

V 

It  can  be  shown  that  the  minimum  value  Y'R  of  Y'B  is 

Replacing  by  N  the  N  —  1  in  the  denominator  of  YrR  gives  the  simplified 
criterion  for  immediate  acceptance 

r  <  (JL  -  w  -  i)  (196) 

If  the  graphical  procedure  is  used,  the  adjustment  constant  A  should 
be  chosen  so  as  to  minimize  the  variation  in  Y  as  X  varies  over  the  region 
of  likely  sample  values.  This  flattens  the  curves  defining  the  acceptance 
region,  thus  enabling  the  acceptance  region  to  encompass  a  larger 
portion   of  the   chart's   area   and  thereby  improving  the  accuracy   of 
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plotting.  To  determine  the  optimum  value  of  A,  consider  the  slope  of 
Y'E  at  a  given  value  of  X 

§  -  f  -  2A  (197) 

For  a  given  value  of  X,  the  slope  is  zero  when  A  =  X/N.  The  optimum 
value  of  A  depends,  therefore,  on  the  observed  value  of  X.  If  before 
the  sample  is  inspected  \i'  is  the  best  estimate  of  the  mean  of  the  char- 
acteristic being  inspected,  the  optimum  value  of  A,  for  graphical  purposes, 
is 

A  =  \x'  (198) 

since  the  expected  value  of  X  is  N/jl' '. 

If  the  graphical  procedure  is  not  used,  the  adjustment  constant  A 
should  be  chosen  so  as  to  maximize  the  effectiveness  of  formula  (196)  in 
permitting  immediate  acceptance.  This  involves  finding  the  value  of  A 
that  minimizes  the  Y  corresponding  to  the  minimum  value  Y'  =  Y'R. 
Differentiating  (193)  with  respect  to  X  gives 


dV 

^  =  2aX  -  (b  +  2A)  (199) 


Since 


^jf  =  2a  >  0  (200) 

dYr 
setting  -r~  equal  to  zero  and  solving  for  X  gives  the  value  of  X  at  which 

Y'R  is  a  minimum,  namely, 

xr       b  +  2  A 


2a 


(201) 


The  value  of  Y'R  calculated  from  (193)  for  the  point  whose  abscissa  is 
given  by  (201)  is  the  minimum  value  of  Y'R 

Y'R  =  -  {b  +JA)2  +  (c  +  NA*)  (202) 

V 

The  value  of  YR  given  by  (195)  can  be  obtained  from  (202)  by  replacing 
a,  b,  and  c  by  their  values  given  by  (175),  (176),  and  (177).  Finding  the 
value  of  Y  corresponding  to  Y'  =  Y'R,  that  is,  adding  2 AX  to,  and  sub- 
tracting NA2  from,  (202)  gives 

Y  =  -  (6  +  2A)2  +  (c  +  NA2)  +  2AX  -  NA2  (203) 

=  _(±±2Ay  +  C  +  2AX 

4a 

Now,  the  optimum  value  of  A  is  that  value  which  makes  the  Y  corre- 
sponding to  the  minimum  value  YR  as  large  as  possible,  since,  for  a  given 
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value  of  X,  if  Y  is  less  than  the  value  given  by  (203),  the  lot  is  accepted. 
Differentiating  (203)  with  respect  to  A  gives 

dY=_Q>  +  2A) 

dA  a  K       ' 

Since 

zp.--!<°  (205) 

dY 
setting  -tj  equal  to  zero  and  solving  for  A  gives  the  value  of  A  at  which 

Y  is  a  maximum,  namely, 

A  =  aX  -^  (206) 

The  value  of  A  given  in  (206)  is  a  function  of  X,  so  the  optimum  value  of 
A  depends  on  the  observed  value  of  X.  If  before  the  sample  is  inspected 
//  is  the  best  estimate  of  x,  the  anticipated  value  of  X  is  Np'.  Replacing  X 
in  (206)  by  its  anticipated  value  gives 

A  =  aNn'  -  |  (207) 

Replacing  a  and  b  in  (207)  by  their  values  given  by  (175)  and  (176),  and 

simplifying,  gives 

A        (Nk2  +  N  ~  !V-  W-1)U 

A  -  ^p  (208) 

Replacing  by  N  the  N  —  1  in  the  numerator  of  (208)  gives 

Approximate  A  =  (F  +  ffi'  -  -  (209) 

Using  the  value  of  A  given  by  either  (208)  or  (209)  makes  (196)  very 
nearly  the  most  effective  in  permitting  immediate  acceptance.  In  terms 
of  F,  replacing  the  A  in  (203)  by  (207),  and  simplifying,  gives  the  linear 
condition  for  acceptance 

Y  <  cf  -  b'X  (210) 

where 

b'  =  b-2Nn'a  (211) 

c'  -  c  -  NV2a  (212) 

and  a,  b,  and  c  are  denned  by  (175),  (176),  and  (177). 

11.4.  Use  of  Non-central  t  Distribution 

11.4.1.  Introduction. — The  acceptance  criterion 

x  +  ks  <  U  (213) 

can  be  written 

U  —  x 

>  k  (214) 
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The  quantity  (U  —  x)/.s  is  a  function  of  the  sample  data  and  fc  is  a 
constant;  using  the  acceptance  criterion  (214)  is  thus  equivalent  to  com- 
paring the  value  of  the  sample  statistic  (U  —  x)/s  with  a  minimum  level 
for  acceptance  k. 

If  many  samples  of  N  observations  are  chosen  at  random  from  the 
same  population  (or  lot),  the  values  of  (U  —  x)/s  obtained  from  the 
various  samples  will  vary.  The  proper  determination  of  k  requires  a 
knowledge  of  the  sampling  distribution  of  (U  —  x)/s. 

The  distribution  of  the  quantity  t,  defined  as 


t  =  Vn 


(^J) 


(215) 


is  known  as  the  non-central  t  distribution,  of  which  the  familiar  Student's 
t  distribution  is  a  special  case  corresponding  with  Ex  =  U.  Tables  of  the 
non-central  t  distribution  with  an  explanation  of  their  application  to  the 
problems  considered  in  this  chapter  have  been  given  by  Johnson  and 
Welch. 

11.4.2.  Notation  Used  in  the  Tables. — It  will  be  necessary  to  use  the 
notation  of  Johnson  and  Welch  to  explain  the  application  of  their  tables 
to  the  problems  discussed  in  this  chapter.  The  following  list  indicates 
the  changes  in  notation:* 


Value  of  variable  that  defines  quality 

Sample  statistic 

Acceptable  level  of  statistic 

Normal  deviate  corresponding  with  proportion 

p  defective  in  lot 

Normal  deviate  corresponding  with  probability  e 

of  accepting  a  lot 


Johnson  and  Welch 


L 

L  -  x 

s 

u 

K 


*  A  word  of  explanation  is  in  order  about  our  departure  from  the  Johnson- Welch 
and  Jennett- Welch  notation,  since  their  notation  is  satisfactory  and  established. 
Throughout  all  three  of  our  volumes  the  letter  L  is  consistently  used  to  represent  the 
ordinate  of  an  OC  curve;  as  a  substitute  U  has  the  mnemonic  advantage  of  association 
with  "upper  limit."  It  seems  unnecessary  to  use  two  symbols  for  the  normal  deviate, 
since  the  type  of  subscript  (Greek  letter,  numeral,  and  so  forth)  can  discriminate 
between  normal  deviates  defining  quality  and  those  describing  operating  charac- 
teristics. For  the  sample  statistic,  k  accords  with  the  lot  parameter  K.  Logic  suggests 
a  subscript  for  the  acceptable  level  of  the  statistic,  and  Johnson  and  Welch  use  0;'but 
because  the  acceptable  level  enters  the  analysis  so  frequently  and  other  values  so 
seldom,  we  use  no  subscript.  Logic  also  suggests  a  subscript  for  a  particular  observed 
value  of  the  statistic,  and  we  use  o. 
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In  the  notation  of  Johnson  and  Welch,  the  acceptance  criterion  (214) 
becomes 

u  >  uQ  (216) 

Let  z  be  a  quantity  distributed  normally  about  zero  with  unit  standard 
deviation,  and  let  w  be  a  quantity  distributed  independently  of  z  and 
in  the  same  way  as  an  estimate  of  variance  computed  from  a  sample 
from  a  normal  population  containing  /  degrees  of  freedom.  That  is,  w 
is  distributed  as  x2//  with  /  degrees  of  freedom.     If  t  is  defined  by 

t  =  ^  (217) 

V  w 

where  8  is  some  constant,  then  t  has  the  non-central  t  distribution, 
depending  only  on  the  eccentricity  8  and  the  number  of  degrees  $f  freedom 

/• 

To  see  that  the  statistic  u  is  closely  related  to  the  non-central  t  distri- 
bution, consider  the  distribution  of  values  of  u  from  random  samples 
of  N  observations  from  a  normal  population  with  mean  n  and  standard 
deviation  a.  Suppose  that  exactly  a  proportion  p  of  the  items  in  the 
population  exceed  a  given  limit  L  and  that  U  is  the  standardized  normal 
deviate  exceeded  with  probability  p,  that  is, 

^/V'*r-.*  (218) 

Let  x  and  s  be  the  sample  mean  and  standard  deviation,  respectively 
Then,  in  the  notation  of  Johnson  and  Welch,  y/N  u  can  be  written  as 
(217)  with 

z  =  ~  V^(*  ~  U)  (219) 

w  =  -'  (220) 

a 

f  =  N  -  1  (221) 

8  =  VN  U  (222) 

Hence,  \/N  u  is  distributed  as  a  non-central  t  with  /  =  N  —  1  and 
8  =  y/N  U. 

The  distribution  of  non-central  t  requires  a  triple-entry  table,  since 
the  probability  that  t  exceeds  a  given  level  to  depends  on  /,  8,  and  20. 
This  probability  is  denoted  by  P(f,8,t0).  Johnson  and  Welch  also  use 
t(f,8,e)  to  denote  that  value  of  t0  for  which  the  probability  P(f,8,t0) 
takes  a  specified  value  e.  The  value  of  8  that  makes  P(f,8,t0)  =  e  is 
denoted  by  8(f,U,e). 

11.4.3.  Description  of  the  Tables. — The  problems  for  which  the  non- 
central  t  tables  are  used  in  this  chapter  require  the  evaluation  of  either 
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5(f,to,e)  or  t(f,5,e).     Table  IV  given  by  Johnson  and  Welch  facilitates  the 
direct  calculation  of  these  for  17  probability  levels  corresponding  with  e 
and  1-e  equal  to  0.005,  0.01,  and  0.025,  0.05,  0.1,  0.2,  0.3,  0.4,  and  0.5. 
In  Table  IV,  a  quantity  X  =  \(f,t0,e)  is  given  such  that 


8(f,to,e)  =t0-\^Jl+^f  (223) 

and 


«52        X 


s  +  x  Ji  h ;, 


t(fM  =  Jf       2f  (224) 

1  —  — 
2/ 

Equations  (42)  and  (21)  of  this  chapter  are  essentially  approximations  for 
equations  (223)  and  (224),  respectively,  with  X  in  (223)  and  (224) 
replaced  by  K€,  the  normal  deviate  exceeded  with  probability  e.  Even 
for  small  samples,  it  is  found  that  X  does  not  differ  much  from  Ke. 

Table  IV  consists  of  nine  double-entry  panels  (one  for  each  of  the 
values  of  e  listed  above)  showing  X  as  a  function  of  /  and  t0.  Values  of  X 
are  given  corresponding  to  /  =  4,  5,  6,  7,  8,  9,  16,  36,  144,  and  oo.  In 
order  to  simplify  interpolation  for  intermediate  values  of/  greater  than  9, 
the  values  of/  =  9,  16,  36,  144  and  oo  are  such  that  12/V7  =  4,  3,  2,  1, 
and  0,  respectively;  that  is,  X  is  tabled  at  equal  intervals  of  12/ vT- 
The  whole  range  of  t0  from  —  oo  to  oo  is  covered  as  follows:  For  t0/\//2f 
between  —  oo  and  —0.75  or  between  +0.75  and  +  oo,  X  is  tabled  against 
the  quantity 

v  =  -r=  (225) 

For  t0/\/2f  between  —0.75  and  +0.75,  X  is  tabled  against 

V'  -   ~#L  (226) 

tr 


V1 4  - 


2/ 

11.4.4.  Calculation  of  8(f,to,e). — For  given/,  t0,  and  e  <  0.5,  the  steps 
required  for  the  calculation  of  8(f,t0,e)  are  given  below.  For  e  >  0.5, 
the  relation 

fi(/,«o,e)   =    -5(/,-M  -e)  (227) 

is  used.  That  is,  if  6(9,12,0.95)  is  required,  calculate  5(9,-12,0.05) 
by  the  procedure  outlined  below  and  take  the  negative  of  this  value  as  the 
required  value  of  5(9,12,0.95). 
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Step  1. 
Step  2. 


Calculate  U/y/2f. 

Find   2/ 

£o 


Vl  +  tl/2f 


or   */'  = 


to/y/2f 

Vi  +  *2/2/ 


according  as 


Step  3. 
£tep  4. 


/Step  5. 


is  greater  than  or  less  than  0.75. 
calculate  12/  \//- 


V2/ 

If  /  is  greater  than 

From  the  panel  of  Table  IV  corresponding  with  the  given  e, 
obtain   X  =  \(f,to,e)   by  interpolating  with  respect   to   the 
quantities  obtained  in  steps  2  and  3. 
Calculate  8(f,U,e)  =  t0  -  \  Vl  +  4/2f. 

11.4.5.  Calculation  of  t(f,8,e). — The  calculation  of  t(f,8,e)  is  accom- 
plished by  an  iterative  method,  since  X  is  given  in  Table  IV  as  a  function 
of  £0.  The  successive  steps  in  the  iterative  method  for  calculating  t(f,8,e) 
for  given  /,  8,  and  e  <  0.5  are  given  below.     For  e  >  0.5,  the  relation 

t(f,8,e)  =  -t(f,-8,l  -e)  (228) 

is  used.  That  is,  if  £(9,5,0.90)  is  required,  calculate  £(9,-5,0.10)  by 
the  procedure  outlined  below,  and  take  the  negative  of  this  value  as  the 
required  £(9,5,0.90). 

Step  1.     Determine  K e,  the  normal  deviate  exceeded  with  probability 
e,  which  can  be  found  in  Table  1.1. 
Calculate  the  first  approximation  t\  from 


Step  2. 


5  + 


K.yj 


1  +  2T- 


2/ 


1  -^ 


Step  3. 
Step  4. 


Step  5. 
Step  6. 


Step  7. 


Calculate  h/V^f. 
Find    y  = 


2/ 


or    y 


h 


ti/V2f 
Vl  +  t\/2f   "'     7         Vl  +*!/2/ 
is  greater  than  or  less  than  0.75. 


according   as 


V2f 

If/  is  greater  than  9,  calculate  12/V/- 

From  the  panel  of  Table  IV  corresponding  with  the  given 

e,  obtain  Xi  =  \(f,the)  by  interpolating  with  respect  to  the 

quantities  obtained  in  steps  4  and  5. 

Calculate  a  second  approximation  £2  from 


5  +  X 


0-1) 


^2/       2/ 
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Step  8.  Repeat  steps  3  through  7,  replacing  ti  by  t2,  and  calculate  a 
third  approximation  ts.  Repeat  these  steps  until  two  suc- 
cessive approximations  are  the  same;  usually  not  more  than 
two  approximations  are  required.  Take  for  t(f,8,e)  the  value 
so  obtained. 

11.4.6.  Illustration  of  the  Use  of  the  Non-central  t  Tables  in  Specific 
Problems 

Problem  I.     Given:  N,  ph  e   (in  our  notation,  N,  ph   1  —  a  or 
N,  p2,  (3) 
Required:  u0  (in  our  notation,  k) 

A  single-sampling  plan  based  on  variables  is  defined  by  a  sample  size 
N  and  a  value  u0  of  the  statistic  u.  If  N  is  fixed,  u0  can  be  determined  so 
that  the  operating-characteristic  curve  for  the  plan  passes  through  a 
specified  point  (pi,e);  that  is,  for  a  fixed  sample  size,  a  plan  is  chosen  for 
which  the  probability  of  accepting  lots  of  quality  px  is  e. 

For  given  N,  pi,  and  e,  the  value  u0  that  defines  the  plan  is  given,  in 
the  notation  of  Johnson  and  Welch,  by 

VNuo  =  t(N  -  l,\/NUhe)  (229) 

where  V\  is  the  normal  deviate  exceeded  with  probability  px.  Thus,  for 
a  given  sample  size  N,  quality  (fraction  defective)  ph  and  probability 
of  acceptance  e,  the  determination  of  u0  requires  the  calculation  of 
t(f,8,e)  where/  =  N  -  1  and  5  =  y/NUx. 

An  outline  of  the  method  for  determining  u0  for  given  values  of  N, 
pi  and  e  is  given  below. 
Given:     N  =  sample  size 

pi  =  a  proportion  defining  quality 
e  =  desired  probability  of  accepting  lots  of  quality  pi 

Step  1.     Determine  Uh  the  normal  deviate  exceeded  with  probability 

pi,  from  Table  1.1. 
Step  2.     Calculate 

f=N-  1 

Step  3.     Calculate  t(f,8,e)  by  the  method  described  in  Sec.  11.4.5. 
Step  4.     Calculate  uQ  =  t(f,8,e)/\/N. 

Problem  II.     Given:  N,  u0,  e  (in  our  notation,  N,  k,  Lp) 
Required:  p 

For  every  single-sampling  plan  (defined  by  N  and  u0),  there  is  an 
operating-characteristic   (OC)   curve  which  shows  the  probability  e  of 
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accepting  a  lot  of  quality  p.  From  the  Johnson  and  Welch  tables  we  can 
compute  the  values  of  p  corresponding  to  e  or  1  —  e  equal  to  0.005,  0.01, 
0.025,  0.05,  0.1,  0.2,  0.3,  0.4,  or  0.5.  This  requires  finding  5(f,t0,e)  where 
/=  N  —  1  and  t0  =  s/Nuo.  The  value  of  U  corresponding  to  this  value 
of  8  is  b/y/N.  The  value  of  p  corresponding  to  the  given  e  is  then  the 
probability  that  a  normal  deviate  will  exceed  this  U,  which  may  be  found 
from  Table  1.5. 


Step  1.     Calculate 


t0  =  \^NuQ 
f=N-l 


Step  2.     Calculate  d(f,t0,e)  by  the  method  described  in  Sec.  11.4.4. 
Step  3.     Calculate  U  =  5(f,t0,e)/VN. 

Step  4.     Obtain  the  required  p  by  entering  Table  1.5  with  K  equal  to 
the  value  of  U  found  in  step  3. 

11.4.7.  Work  Sheets. — Suppose  that  the  sample  size  is  fixed  at  30, 
and  a  plan  is  required  for  which  a  lot  having  0.1  percent  defective  will 
be  accepted  with  probability  0.99.  That  is,  N  =  30,  pi  =  0.001,  and 
€  =  0.99.  Work  sheets  1,  2,  and  3  (Tables  1.9,  1.10,  and  1.11)  represent 
the  calculations  necessary  to  determine  u0;  work  sheet  4  (Table  1.12) 
represents  those  for  the  OC  curve.  The  numerical  entries  have  been 
rounded  from  calculations  originally  carried  to  more  decimal  places,  so 
they  do  not  always  agree  exactly  with  operations  carried  out  on  the  5-deci- 
mal-place  figures  shown. 


TABLE  1.9 
Work  Sheet  1  for  Determining  u0  (in  Our  Notation  k)  from  Non-central  t 

Tables 


Row  number 

Operation 

Result 

Example 

(1) 

(2) 

(3) 

(4) 

(1) 

Given 

Vi 

0.001 

(2) 

Given 

e 

0.99 

(3) 

Given 

N 

30. 

(4) 

Table  1.1 

Ux 

3.09023 

(5) 

V(3) 

Vn 

5.47723 

(6) 

(3)  -1 

f  =  N  -  1 

29. 

(7) 

(4)  X  (5) 

8     =     y/N    Ul 

16.92590 

(8) 

Sheet  2,  row  (14), 
last  column 

t(fXe) 

12.47223 

(9) 

(8)  +  (5) 

u0  =    KJJ  ' 

2.27711 
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TABLE  1.10 
Work  Sheet  2  for  Determining  u0 — Computation  of  t 


(SM 


Row 
number 

(1) 

Operation 

(2) 

Result 

(3) 

First 
approximation 

(4) 

Second 
approximation 

(5) 

(1) 
(2) 
(3) 
(4) 
(5) 

Sheet  1,  line  6 
Sheet  1,  line  7* 
Sheet  1,  line  2* 
2  X  (1) 

(2)2 

/ 

8  or  —  8 

e  or  1  —  e 

2/ 

82 

29 
-16.92590 
0.01 

58 
286.48606 

< — 

(6) 

(5)  +  (4) 

52 
2/ 

4.93941 

«- 

(7) 
(8) 

Table  1.1 

(7)2 

Ke2 

2.32635 
5.41190 

2.321f 
5 . 38704 

(9) 

(8)  +  (4) 

2/ 

a2       Ke2 

1  +  2f~^r 

0.09331 

0.09288 

(10) 

1  +  (6)  -  (9) 

5.84611 

5 . 84654 

(11) 

vW) 

j         52       Ke2 

V1+2/~^T 

2.41787 

2.41796 

(12) 

(2)  +  (7)  X  (11) 

/         82      Ke2 

-11.30109 

-11.31381 

(13) 

1  -  (9) 

2/ 

0.90669 

0.90712 

(14) 
(15) 

(12)  -4-  (13) 

VW) 

t  =  t(f,8,e) 
V2f 

-12.46410 
7.61577 

-12.47223 

(16) 

(14)  ■*■  (15) 

t 

VW 

- 1 . 63662 

-1.63768 

(17) 

(14)2 

t2 

155.35379 

155.55660 

(18) 

(17)  +  (4) 

t2 
2/ 

2.67851 

2.68201 

(19) 

1  +  (18) 

t2 
i+2f 

3.67851 

3.68201 

(20) 

VW) 

V1+J 

1.91795 

1.91886 

(21) 

1  -4-  (20) 

(16)  4-  (20) 

VW) 

12  -=-  (23) 

"  =  1/V1+J 

0.52139 

% 
5.38516 

2.22834 

0.52114 

(22) 
(23) 
(24) 

v7 

12 

V? 

t 

(25) 

Sheet  3,  row  (17), 
last  column 

X 

2.321 

2.321 

*  Since  e  >  0.5,  we  change  the  sign  of  5  and  replace  «  by  1  —  e. 

t  The  figure  in  row  (7),  column  (5)  is  the  value  of  Xi  given  in  row  (25),  column  (4)  and  is  used  in 
place  of  Ke  in  the  succeeding  calculations  for  X2. 

t  No  figures  are  given  for  y'  in  this  example  because  to/-\/2f  >  0.75. 


Sec.  11.4.7 


VARIABLES 


79 


TABLE  1.11 
Work  Sheet  3  for  Determining  u0 — Linear  Interpolation  for  \(a,b) 


Row 
number 

Operation 

Result 

First 
approximation 

Second 
approximation 

(1) 

(2) 

(3) 

(4) 

(5) 

(1) 

Sheet  2,  line  21* 

a 

0.52139 

0.52114 

(2) 

Sheet  2,  line  24* 

b 

2.2283 

2 . 2283 

(3) 

Table  IV* 

tti 

0.5 

0.5 

(4) 

Table  IV* 

a-2 

0.6 

0.6 

(5) 

Table  IV* 

&i 

3. 

3. 

(6) 

Table  IV* 

62 

2. 

2. 

(7) 

(3)  -  (1) 
(3)  -  (4) 

(i\  —  a 

Ci   =  

cl\  —  a2 

0.2139 

0.2114 

(8) 

(1)  -  (4) 
(3)  -  (4) 

a  —  a2 
c2  = 

d\   —  <22 

0.7861 

0 . 7886 

(9) 

(5)  -  (2) 
(5)  -  (6) 

61-6 

di=b1-  &2 

0.7717 

0.7717 

(10) 

(2)  -  (6) 
(5)  -  (6) 

,         6  -&, 
d2  =  fc  -  b2 

0.2283 

0.2283 

(11) 

Table  IV* 

Xn 

2.318 

<_ 

(12) 

Table  IV* 

X12 

2.324 

< — 

(13) 

Table  IV* 

x21 

2.310 

^_ 

(14) 

Table  IV* 

x22 

2.318 

^_ 

(15) 

(8)(11)  +  (7) (13) 

C2X11  +  C1X21 

2.31629 

2.31631 

(16) 

(8) (12)  +  (7) (14) 

C2X12  +  CiX22 

2.32272 

2 . 32273 

(17) 

(10)(15)  +  (9)(16) 

\(ab) 

=  dt(ci\u  +  C1X21) 
+  d}(c2\i2  +  CiX22) 

2.321 

2.321 

*  For  the  given  e,  X  is  a  function  of /and  to  and  is  given  in  Johnson  and  Welch's  Table  IV,  for  specified 
values  of  y  (or  y')  and/  (or  12/ a//)-  a  and  b  are  the  values  of  the  arguments  (in  this  case  y  and  12/ V?. 
respectively)  for  which  X  is  required,  ai,  at,  &i,  and  62  are  the  values  of  the  arguments  for  which  Table 
IV  gives  X  values  and  between  which  we  interpolate  to  find  X(o,6).  Xn,  X12,  X21,  and  X22  are  the  tabular 
values  corresponding  to  at,  a<i,  61,  and  62.      The  following  scheme  explains  the  notation; 


bi 

b2 

a\ 

Xn 

X12 

at 

X21 

X22 

The  computation  of  the  OC  curve  for  the  above  plan  is  illustrated  by 
finding  the  values  of  p  for  which  Lp  is  0.2  and  0.7.  The  figures  given  in 
Table  1.12  represent  the  calculations  necessary  to  determine  the  values  of 
p  corresponding  with  these  two  values  of  Lp.  Additional  columns  would 
be  required  for  additional  points  on  the  OC  curve. 
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TABLE  1.12 

Work  Sheet  4  for  Operating-characteristic  Curve 


(1) 

(2) 

(3) 

(4) 

(5) 

Row 

number 

Operation 

Result 

First  point 

Second  point 

(1) 

Given 

LP 

0.2 

0.7 

(2) 

Smaller  of  Lp  and 
1  -  Lp 

€ 

0.2 

0.3 

(3) 

Sheet  3,  line  1,  last 
column 

a 

0.52114 

*~ 

(4) 

Sheet  3,  line  2,  last 
column 

b 

2 . 2283 

<_ 

(5) 

Sheet  3,  row  3* 

di 

0.6 

0  5 

(6) 

Sheet  3,  row  4* 

do 

0.5 

0.6 

(7) 

Sheet  3,  row  5 

bx 

3. 

<- 

(8) 

Sheet  3,  row  6 

b2 

2. 

«- 

(9) 

(5)  -  (3) 
(5)  -  (6) 

d\  —  a 

0 . 7886 

0  2114 

a\  —  a-2 

(10) 

(3)  -  (6) 
(5)  -  (6) 

a  —  a* 

0.2114 

0.7886 

d\    —    d% 

(ID 

(7)  -  (4) 
(7)  -  (8) 

dl=bt-    b2 

0.7717 

- 

(12) 

(4)  -  (8) 

(7)  -  (8) 

,               b    ~b2 

Oi   —  02 

0.2283 

- 

(13) 

Table  IV  f 

Xn 

0.9125 

0.4318 

(14) 

Table  IV  f 

Xl2 

0.8899 

0.4632 

(15) 

Table  IV  f 

X21 

0.9192 

0.4408 

(16) 

Table  IV  t 

X22 

0.8945 

0.4691 

(17) 

(10) (13)  +  (9) (15) 

C2X1I    +   C1X21 

0.91778 

0.43370 

(18) 

(10)(14)  +  (9)(16) 

C2X12    +   C1X22 

0.89353 

0.46445 

(19) 

(12)(17)  +  (11)(18) 

X(o6) 

=  d2(c2Xii  +  C1X21) 

+  C?i(c2Xi2    +  C1X22) 

0.8991 

0.4574 

(20) 

Sheet  2,  row  14,  last 

column  f 
Sheet  2,  row  20,  last 

column 

h 

12.47223 
1 . 91886 

-12.47223 

(21) 

V-+i 

- 

(22) 

(20)  -  (19)  X  (21) 

..-^■+1 

10.74699 

-13.34992 

(23) 

(22)  if  e  =  Lp 

-  (22)  if  e  =  1  -  Lp 

5 

10.74699 

13.34992 

(24) 

Sheet  1,  row  5 

\/N 

5.47723 

*- 

(25) 

(23)  -*■  (24) 

U=^ 

1.96212 

2.43735 

(26) 

Table  1.5 

V 

0 . 0249 

0.0074 

*  The  interchange  of  ai  and  a-i  in  column  (4)  is  made  for  consistency  with  the  interpolation  scheme  of 
sheet  3. 

t  Use  the  panel  of  Table  IV  that  corresponds  with  the  value  of  e  given  in  row  (2).  Note  that  the 
portion  of  Table  IV  where  t  is  positive  is  used  for  column  (4)  and  the  portion  where  t  is  negative  is  used 
for  column  (5). 

If  only  N  and  uo  are  given  and  no  preliminary  calculations  are  available  from  work  sheets  for 
determining  tto  by  the  exact  method,  then  sheet  4  must  be  supplemented  by  a  sheet  consisting  of  rows 
(14)  through  (24)  of  sheet  2,  with  t  taken  as  y/~N  m  and /taken  as  N  —  1. 
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11.5.  Construction  of  Table  1.2 

Problem.     Given:  ph  p2,  a,  and  0 
Required:  N  and  k 

A  single-sampling  plan  based  on  variables  is  denned  by  a  sample 
size  N  and  a  value  k,  the  multiple  of  the  standard  deviation  in  the  criterion 
x  +  ks  <  U  for  accepting  the  lot,  where  U  is  the  value  above  which  the 
characteristic  being  inspected  is  regarded  as  defective.  If  no  restrictions 
are  imposed  on  sample  size,  a  plan  can  be  constructed  for  which  the 
operating-characteristic  curve  will  pass  through  two  specified  points.  In 
practice,  the  first  point  often  corresponds  with  a  lot  quality  pi  (AQL)  for 
which  it  is  desirable  that  the  probability  of  rejection  should  be  only  a 
(producer's  risk),  and  the  second  point  corresponds  with  a  lot  quality 
Pi  (LTPD)  for  which  it  is  desirable  that  the  probability  of  acceptance 
should  be  only  /3  (consumer's  risk). 

Consider  the  family  of  all  OC  curves  that  pass  through  the  point 
(pi,l  —  a)  but  which  have  differing  values  of  N.  If  N  is  small  these 
curves  decline  gradually  as  p  increases  from  ph  and  if  N  is  large  they 
decline  sharply.  There  will  be  one  OC  curve  among  the  family  that 
passes  closer  than  any  other  curve  to  the  point  (p2,/3).  Since  N  takes 
only  integral  values,  there  usually  will  be  no  curve  that  passes  exactly 
through  (p2,(3).  A  curve  could,  however,  be  made  to  pass  exactly  through 
(p2,P)  if  it  were  allowed  to  deviate  from  (pi,l  —  a).  Table  1.2  has  been 
constructed  in  such  a  way  that  the  OC  curves  generally  pass  not  exactly 
through  either  (pi,l  —  a)  or  (p2)j3)  but  very  close  to  both.  The  distance 
of  the  OC  curve  from  the  intended  two  points  has  been  measured  hori- 
zontally, that  is,  by  the  difference  between  the  actual  and  intended 
qualities  for  which  the  probabilities  of  acceptance  are  1  —  a  and  (3. 

The  values  of  N  and  k  in  Table  1.2  were  obtained  in  three  stages: 
(1)  obtaining  a  first  approximation  to  N  and  k;  (2)  determining  the  degree 
to  which  the  preliminary  values  of  N  and  k  satisfy  the  requirement  that 
the  OC  curve  pass  through  (pi,l  —  a)  and  (p2,  |S);  and  (3)  adjusting  N 
or  k  to  improve  the  approximation. 

Step  1.  From  ph  p2,  a,  and  (3,  determine  Kh  K2,  Ka,  and  Kp,  the 
normal  deviates  exceeded  with  probabilities  ph  p2,  a,  and  /3, 
respectively,  as  given  by  Table  1.1. 

Step  2.     Calculate  the  first  approximation  to  k  from 

7  _  KaK2  -f-  KjiKi 
Ka  +  Kp 

Step  3.     Calculate  an  approximate  sample  size  N  from 

k2  +  2 


N 


(Kg     +    KjV 

VKi  -  kJ 
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Step  4.  By  the  method  given  in  Sec.  11.4  for  determining  the  quality 
p  corresponding  to  a  specified  probability  of  acceptance  Lp, 
determine  pi-a  and  p$  where 

Pi-a  =  the  quality  for  which  the  probability  of  acceptance 

is  1  —  a 
pp      =  the  quality  for  which  the  probability  of  acceptance 
is  (3 
If  p^a  and  pp  are  sufficiently  close  to  pi  and  p2,  respectively, 
then  the  values  of  k  and  N  given  by  steps  2  and  3  define  a 
satisfactory  sampling  plan. 
Step  5.     If  the  discrepancy  between  pi_a  and  pi  or  that  between  p$  and 
pi  is  too  large,  determine  a  better  plan  by  adjusting  either  N 
or  k  according  to  the  following  rules: 

a.  If  pi-a  is  greater  than  pi  and  p$  is  less  than  p2,  take  the 
next  lower  integer  for  N  and  the  same  k,  and  repeat  step  4. 

b.  If  pi-a  is  less  than  pi  and  p$  is  greater  than  p2,  take  the 
next  higher  integer  for  N  and  the  same  k,  and  repeat 
step  4. 

c.  If  pi-a  is  less  than  p±  and  p$  is  less  than  p2,  use  the  same 
N  and  a  smaller  value  of  k,  and  repeat  step  6. 

d.  If  pi_<*  is  greater  than  p\  and  p^  is  greater  than  p2,  use 
the  same  N  and  a  larger  value  of  k,  and  repeat  step  6. 

These  rules  can  be  applied  regardless  of  the  method  used  for  finding 
the  first  approximation  to  k  and  N.  When  these  are  found  by  steps  2  and 
3,  however,  rule  d  will  usually  be  applicable. 

Calculate  the  better  approximation  for  k  mentioned  in  (c)  and  (d) 
from 

,   _  \\K2  -f  X2if  i 

X!   +  X2 

where 

Xi  =  value  of  X  computed  in  determining  pi-a  in  step  4 
X2  =  value  of  X  computed  in  determining  p$  in  step  4 

Step  6.  For  the  required  N  and  k,  take  the  pair  of  values  for  which 
pi_a  and  pp  are  close  enough,  or  closest,  to  the  given  values  of 
pi  and  p2,  respectively. 

Table  1.2,  which  gives  values  of  N  and  k  for  various  combinations  of 
pi  and  p2  when  a  =  0.05  and  (3  —  0.10,  was  constructed  by  essentially 
the  method  described  above.  For  each  plan  in  Table  1.2,  except  that 
for  pi  =  0.05  and  p2  =  0.40,  the  true  values  of  pi  and  p2  agree  exactly 
with  the  intended  values  when  rounded  to  the  number  of  decimal  places 
with  which  the  intended  pi  and  p2  are  given  in  the  table;  in  many  cases 
the  agreement  is  much  better.  Minor  improvements  could,  no  doubt,  be 
made  in  many  cases. 
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11.6.  Sequential  Analysis  of  Proportion  Defective  by  Variables* 

The  fact  that 

z  =  x  +  ks  (230) 

can  be  taken  as  approximately  normally  distributed  with  mean  and 
standard  deviation  approximately 

Z  =  M  +  for  (231) 

and 

a(N)  =  aF(N)  (232) 

where 


F(N)  =  ^l  +  o/A;     ix 


ft2 

AT    '  -2(JV  -  1) 


may  be  made  the  basis  of  a  sequential  procedure  for  controlling  percent 
defective  by  means  of  variables. 

For  a  lot  of  acceptable  quality,  that  is,  one  in  which  the  mean  of  z 
is  Zi  as  defined  by  (155),  the  probability  density  of  z  is  given  by 

P(z\Zi)  =  *    ,-  e~^  (234) 

<r(iV)  V27T 

Similarly,  for  a  lot  of  unacceptable  quality,  that  is,  one  in  which  the  mean 
of  z  is  Z2  as  defined  by  (156),  the  probability  density  of  z  is 

P(z\Z2)  =    — i  <T^W  (235) 

cr(iV)  V2tt 

A  sequential  test  is  made  by  computing  X  after  each  observation, 
where 

X  =  log,|g|)  (236) 

X  >  a  (237) 


and  rejecting  the  lot  if 

accepting  it  if 

X  <  -b  (238) 

*  See  Chap.  17  for  a  brief  account  of  sequential  analysis  and  for  references  to  the 
fundamental  work  on  this  subject,  the  theory  of  which  was  developed  in  the  Statistical 
Research  Group  by  A.  Wald.  An  exact  treatment  of  proportion  defective  problems  by 
sequential  analysis  of  variables  involves  the  ratio  of  densities  of  two  non-central 
t  distributions.  Henry  Goldberg  has  shown  that  this  may  be  expressed  in  terms  of  a 
confluent  hypergeometric  function,  tables  of  which  were  prepared  for  the  Statistical 
Research  Group  by  the  New  York  Mathematical  Tables  Project  in  a  form  suitable 
for  use  in  sequential  analysis.  There  is  in  preparation  an  additional  section  for 
Sequential  Analysis  that  will  include  these  tables  and  an  explanation  of  their  use. 
The  present  section  simply  outlines  a  sequential  procedure  based  on  the  approxima- 
tions used  elsewhere  in  this  chapter  for  single  sampling. 
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or  taking  another  observation  if 

-b  <  X  <  a  (239) 

where 

a  =  logei^  (240) 

a 

b  =  \oge~^  (241) 

a  being  the  probability  of  rejecting  a  lot  of  quality  Z\  and  /3  the  proba- 
bility of  accepting  a  lot  of  quality  Z2.  Each  calculation  of  X  is  based  on 
all  observations. 

To  obtain  X,  we  take  the  logarithm  of  (234)  and  subtract  it  from  the 
logarithm  of  (235),  which  yields 

x  -  2TOT [i*  - Zi)2  - (z  -  ^  =  zwm  (2  -  H^2)  (242) 

Substituting  for  Zx  and  Z2  from  (155)  and  (156)  and  for  *(N)  from  (232), 
and  rearranging,  gives 


x  =  wmw  r  ~ l  ~ "  v  — 2~~7. 


which  may  be  rewritten  as 

r  \[F(iV)P  Xt  +  JT, 

«  -  °.  +  »  [if:  -  X,  2 

This  will  be  independent  of  a  if 


+  fc 


(243) 


(244) 


k  =  £+*■  -  ^ffl  (245) 

On  substituting  for  F(JV)  from  (233),  there  results  a  quadratic  in  h  for 
which  the  solution  is 

h  =  Vl-AB-l  (246) 

where 

(247) 


(.V  -  l)(Kt  -  K2) 
and 

B  =  iv^f-  gg  -  <*  +  *<>  (248) 

For  each  value  of  N,  two  values  of  fc  are  obtained  from  (246),  the 
first,  kR,  by  replacing  X  in  (247)  and  (248)  by  a,  and  the  second,  kA,  by 
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replacing  X  by  —b,  where  a  and  b  are  given  by  (240)  and  (241).*  Then 
after  each  observation  both 

zB  =  x  +  kRs  (249) 

and 

zA  =  x  +  kAs  (250) 

are  computed.  If  zR  >  U  the  lot  is  rejected.  If  Za  <  U  the  lot  is 
accepted.  If  neither  condition  is  fulfilled,  another  item  is  inspected, 
new  values  of  x,  s,  kR,  and  kA  are  computed  from  all  the  observations,  and 
new  values  of  zR  and  zA  are  compared  with  U. 

An  alternative  computational  procedure  is  to  compute  k0  after  each 
observation,  where 

K  =  ^— *  (251) 

s 

then  if  &<,  <  /c^  the  lot  is  accepted,  if  k0  >  kR  the  lot  is  rejected,  and  if 
kA  <  kQ  <  kR  another  item  is  inspected. 

The  values  of  kA  and  kR  depend  upon  N,  pi,  p<i,  a,  and  /8.  In  any  one 
problem  all  of  these  but  N  are  fixed,  so  a  table  showing  kA  and  kR  for  various 
values  of  N  can  be  prepared  in  advance.  Ifpi  =  0.15,  p2  =  0.30,  a  =  0.01 
(3  =  0.02,  and  N  =  95,  the  values  of  kR  and  fcA  are  0.6651  and  0.8930, 
respectively,  f  For  a  single-sampling  plan,  the  corresponding  value  o 
k  was  found  in  Sec.  4.2.2  to  be  0.7645.  For  the  sequential  plan,  if  no 
decision  had  been  reached  from  the  first  94  observations,  rejection  would 
occur  if  for  the  first  95  observations  x  +  0.6651s  >  U  and  acceptance 
would  occur  if  x  +  0.8930  <  U;  if  neither  condition  were  fulfilled,  another 
observation  would  be  made.  From  the  data  in  Sec.  7.2.2,  where  x 
=  30.232,  s  =  5.000,  and  U  =  35,  we  find  zR  =  33.56,  which,  being  less 
than  35,  does  not  call  for  rejection;  and  za  —  34.70,  which,  being  less 
than  35,  calls  for  acceptance. 

12.  NOTATION 

The  arrangement  is  alphabetical,  the  sequence  of  listing  under  each 
letter  being  as  follows:  first  Roman,  then  Greek  letters;  within  these 

*  Tables  of  a  and  b  in  terms  of  a  and  /3  are  given  in  Statistical  Research  Group 
Sequential  Analysis,  Tables  4.2,  5.1,  6.1,  and  7.31.  These  give  a  and  6  to  3  decima 
places  for  values  of  a  and  /3  of  0.001,  0.01  to  0.05  by  steps  of  0.01,  0.05  to  0.20  by  steps 
of  0.05,  and  0.20  to  0.40  by  steps  of  0.10. 

f  From  the  unpublished  tables  referred  to  in  the  footnote  at  the  beginning  of  Sec. 
11.6,  which  are  based  on  the  likelihood  ratio  computed  from  two  non-central  t  distribu- 
tions, the  corresponding  values  of  kA  and  kR  are  found  to  be  0.6630  and  0.8893.  For 
kR  =  0.6651  and  fc^  =  0.8930,  the  true  values  of  a  and  /3  are,  respectively,  0.0110  and 
0.0177,  which  differ  by  9.6  and  11.7  percent  from  the  intended  values,  0.01  and  0.02. 
Thus,  these  two  OC  points  lie  below  (to  the  left  of)  those  intended,  whereas  with 
single  sampling  the  true  points  generally  lie  above  (to  the  right  of)  those  intended 
(see  Table  1.8). 
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groups,  first  capital,  then  small  letters;  within  these  groups,  first  those 
without,  then  those  with,  primes,  bars,  or  carets ;  within  these  groups,  first 
those  without  subscripts,  next  those  with  literal  subscripts  (following 
the  same  sequence  as  for  letters),  and  finally  those  with  numerical 
subscripts  (in  numerical  order).  Certain  symbols,  notably  a  and  6,  are 
used  in  a  variety  of  different  meanings  that  occur  only  locally;  in  such 
cases  the  listing  is  in  order  of  first  appearance.  Section  11.4  employs  the 
notation  of  Johnson  and  Welch,  which  differs  from  (and  in  some  respects, 
for  example,  U  and  L,  contradicts)  that  used  elsewhere  in  the  chapter. 


Symbol 

See 

Meaning 

A 

p.  37 

An  adjustment  constant  subtracted  from 
each  observation 

A 

p.  61 

Critical  value  of  z  for  accepting  or  rejecting 
a  lot 
A  function  of  X,  N,  K\,  and  K2  that  appears 

A 

p.  84 

equations  (246) 

(247) 

in  formulas  for  the  sequential  procedure 

AQL 

pp.  11,  45 

Acceptable  quality  level  (see  pi) 

a 

pp.   19,  20 

Function  of  N  and  Ka  that  appears  in  the 

equations  (21), 

(22) 

formula  for  k  when  N  is  fixed 

a 

p.   61 

Function    of   K\,    K2,    Ka,    and   Kp   that 

equations  (136) 

(137) 

appears  in  the  formula  for  N 

a 

p    35 

Coefficient  of  quadratic  term  in  condition, 

equations  (55), 

(56) 

quadratic  in  X,  for  acceptance  in  terms 
of  Y 

a 

p.  58 

Multiple  of  a-  in  formula  for  mean  of  dis- 

equation (118) 

tribution  of  s 

a 

pp.  83,  84 

A  critical  value  of  X  in  a  sequential  test  on 

equation  (240) 

the  basis  of  which  the  lot  is  rejected  or 
another  observation  is  taken 

a 

p.   16 

Producer's  risk.  Probability  that  a  lot  of 
acceptable  quality  (p\  or  AQL)  will  be 
rejected 

B 

p.  84 

A  function  of  X,  N,  Kh  and  K2  that  appears 

equations  (246) 

(248) 

in  formulas  for  the  sequential  procedure 

b 

pp.  19,  20 

Function  of  N,  K\,  and  Ka  that  appears  in 

equations  (21), 

(23) 

the  formula  for  k  when  N  is  fixed 

b 

p.  35 

Negative  of  coefficient  of  linear  term  in 

equations  (55), 

(57) 

condition,  quadratic  in  X,  for  acceptance 

p.  58 

in  terms  of  Y 

b 

Multiple  of  a/\/2(N  —  1)  in  the  formula 

equation  (119) 

for  standard  deviation  of  distribution  of  s 

6 

p.  61 

Function  of  k  and  a  that  appears  in  the 

equations  (136) 

(138) 

formula  for  N 

b 

pp.  83,  84 

A  critical  value  of  X  in  a  sequential  test  on 

equation  (241) 

the  basis  of  which  the  lot  is  accepted  or 
another  observation  is  taken 
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Symbol 

See 

Meaning 

but 

pp.  53,  54 

Value  of  b  obtained  from  equation  (57)  by- 

equations  (100),  (103) 

replacing  U  by  M 

bu 

pp.  53,  54 

Same  as  b  given  by  equation  (57) ;  subscript 

equations  (100) 

,  (103) 

U  used  to  distinguish  bu  from  6m 

bf 

p.  41 

Negative  of  coefficient  of  linear  term  in 

equations  (78), 

(79) 

condition,  linear  in  X,  for  acceptance  in 
terms  of  Y 

P 

p.  16 

Consumer's  risk.  Probability  that  a  lot  of 
unacceptable  quality  (p2  or  LTPD)  will 
be  accepted 

c 

p.  35 

Constant  term  in  condition,  quadratic  in  X, 

equations  (55), 

(58) 

for  acceptance  in  terms  of  Y 

CM 

pp.  53,  54 

Value  of  c  obtained  from  equation  (58)  by 

equation  (104) 

replacing  U  by  M 

cu 

pp.  53,  54 

Same  as  c  given  by  equation  (58) ;  subscript 

equations  (100) 

(104) 

U  used  to  distinguish  cu  from  cm 

c' 

p.  41 

Constant  term  in  condition,  linear  in  X,  for 

equations  (78), 

(80) 

acceptance  in  terms  of  Y 

7 

p.  43 

A  proportion  (0  <  y  <  1)  denoting  a 
confidence  coefficient 

7c 

p.  44 

Confidence  coefficient  used  in  judging 
whether  to  continue  normal  inspection 

Tr 

p.  44 

Confidence  coefficient  used  for  judging 
whether  to  resume  normal  inspection 

5 

p.  73 

Johnson  and  Welch  notation  for  a  constant 
appearing  in  the  numerator  of  non-central 

t;  5  =  ^/NU 

*Xf,M 

p.  73 

Johnson  and  Welch  notation  for  the  value 

Sec.  11.4.4 

of  5  that  makes  P(f,S,to)  =  e 

€ 

p.  17 

A    probability     (0  <  e  <  1);    probability 

equation  (9) 

that  a  normal  deviate  will  exceed  Ke 

€ 

p.  73 

Johnson  and  Welch  notation  for  probability 
of  acceptance 

F(N) 

p.  61 

Any  function  of  N  that  is  independent  of  c 

f 

p.  73 

Degrees  of  freedom  of  %2 

h 

p.  42 

Number  of  lots  from  which  samples  are 
taken  and  combined  into  one  large  sample 
for  the  purpose  of  computing  the  process 
average 

K 

p.  9 

(U  —  fx)/<r.  That  is,  the  difference  be- 
tween the  upper  limit  and  the  population 
mean,  as  a  multiple  of  the  population 
standard  deviation 

Ka 

p.  17 

Normal  deviate  exceeded  with  prob- 
ability a 

Kp 

p.  17 

Normal  deviate  exceeded  with  prob- 
ability j8 

Ke 

p.  17 

Normal  deviate  exceeded  with  prob- 
ability € 
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Symbol 

See 

Meaning 

K0 

p. 

21 

Normal  deviate  exceeded  with  prob- 
ability p0 

K1 

p. 

17 

Normal  deviate  exceeded  with  prob- 
ability pi 

K2 

P- 

17 

Normal  deviate  exceeded  with  prob- 
ability p  2 

k 

P- 

12 

Coefficient  of  the  sample  standard  devia- 
tion in  the  acceptance  criterion 

x  +  ks  <  U; 
also,  acceptable  level  of  statistic  k0 

kA 

p. 

84 

Acceptance  level  for  k0  in  a  sequential  test 

k0 

P- 

43 

(U  —  x)/s.  That  is,  the  observed  differ- 
ence between  U  and  the  sample  mean  in 
units  of  the  sample  standard  deviation; 
sample  estimate  of  K 

kR 

P- 

84 

Rejection  level  for  k0  in  a  sequential  test 

k' 

P- 

48 

\/n  times  the  coefficient  of  the  sample 
standard  deviation  in  the  acceptance 
criterion,  £  +  (kr /  y/n)  s  <  U',  used  when 
items  are  eventually  assembled  in  sets  of  n 

Ky 

P- 

47 

Upper  confidence  limit  for  K,  based  on  a 
confidence  coefficient  of  y 

kyc,    kyc 

P- 

45 

Lower  and  upper  limits,  respectively, 
between  which  a  computed  value  of  k0 
leads  under  normal  inspection  to  con- 
tinuing normal  inspection,  based  on  a 
confidence  coefficient  of  yc 

kyr,   ty, 

P- 

45 

Lower  and  upper  limits,  respectively, 
between  which  a  computed  value  of  k0 
leads  under  reduced  or  tightened  inspec- 
tion to  resuming  normal  inspection,  based 
on  a  confidence  coefficient  yr 

L 

P- 

72 

Johnson  and  Welch  notation  for  the  upper 
limit  of  a  variable,  above  which  an  item  is 
defective;  corresponds  with  our  U 

LTPD 

PI 

.  22,  45 

Lot  tolerance  percent  defective  (see  p2) 

Lip 

P- 

15 

Ordinate  of  an  OC  curve  at  abscissa  p. 
The  probability  that  a  lot  of  fraction 
defective  p  will,  if  submitted  to  a  given 
acceptance  test,  be  passed 

Ln,<r 

P- 

16 

Value  of  an  OC  curve  at  /z,  <r.  The  prob- 
ability that  a  lot  of  mean  ^  and  standard 
deviation  cr  will,  if  submitted  to  a  given 
acceptance  test,  be  passed 

Li 

P- 

61 

Probability  that  a  lot  having  mean  Zi  will, 
if  submitted  to  a  given  acceptance  test, 
be  passed 
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Symbol 

See 

Meaning 

U 

P- 

62 

Probability  that  a  lot  having  mean  Z2  will, 
if  submitted  to  a  given  acceptance  test,  be 
passed 

X 

P- 

83 

Natural  logarithm  of  the  ratio  of  two 
probability  densities,  computed  after  each 
observation  in  a  sequential  test 

X  = 

X(f,«o,«) 

P- 

74 

Johnson  and  Welch  notation  for  a  function 

equations  (223),  (224) 

of  /,  £0,  and  e  used  in  the  calculation   of 

8(f,t0,e)  and  t(SM 

Xi 

P- 

82 

Value  of  X  computed  in  determining  pi_a 

x2 

P- 

82 

Value  of  X  computed  in  determining  pp 

M 

P- 

13 

Minimum  value  of  a  variable,  below  which 
an  item  is  defective 

M 

P- 

9 

Population  mean  of  a  normally  distributed 
variable 

P- 

41 

Best  estimate  of  x  available  before  the 
sample  is  inspected 

N 

P- 

12 

Number  of  items  in  a  sample 

Na 

P- 

26 

Sample  size  for  single-sampling  plan  for 
percent  defective  based  on  atributes 

Nl 

P- 

43 

Number  of  items  in  the  ith  sample  of  a 

(i  = 

1,  •  •  •  ,  h) 

group  of  h  samples 

Nm 

P- 

27 

Sample  size  for  single-sampling  plan  for 
percent  defective  based  on  the  mean  when 
the  standard  deviation  is  known 

Ns 

P- 

28 

Sample  size  for  single-sampling  plan  for 
percent  defective  based  on  the  standard 
deviation  when  the  mean  is  known 

Nt 

P- 

27 

Sample  size  for  single-sampling  plan  for 
percent  defective  based  on  the  mean  and 
standard  deviation  when  both  are  un- 
known but  estimated  from  sample 

N' 

P- 

48 

Number  of  sets  of  n  items  in  a  sample 

n 

P- 

48 

Number  of  items  constituting  a  set  which 
is  judged  defective  or  not  on  the  basis  of 
the  average  for  the  set 

n 

P- 

50 

Number  of  variables  upon  which  the 
quality  of  an  item  depends 

OC 

Sec.  3.4 

Operating  characteristics.     The  probability 

Sec.  6 

that  a  given  acceptance  test  will  pass  lots 

of  various  qualities  if  submitted 

PA 

P- 

42 

Process  average  percent  defective 

P(f,*,U) 

P. 

73 

Johnson  and  Welch  notation  for  the  prob- 

ability that   t   exceeds   a   given   level   U 

for  given  /  and  8 

P(z\Zi) 

P- 

83 

Probability  density  of  z  for  lot  in  which  the 

equation  (234) 

mean  of  z  is  Zi 

P{z\ 

Z2) 

P- 

83 

Probability  density  of  z  for  lot  in  which  the 

equation  (235) 

mean  of  z  is  Z2 
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Symbol 

See 

Meaning 

V 

p.   15 

Fraction  defective  (quality)  of  a  lot 

pl-a 

p.  82 

The  quality  for  which  the  probability  of 
acceptance  is  1  —  a  with  a  given  sampling 
plan 

PP 

p.  82 

The  quality  for  which  the  probability  of 
acceptance  is  /3  with  a  given  sampling  plan 

Po 

pp.  20, 

54 

Indifference  quality,  or  lot  fraction  defec- 
tive for  which  acceptance  and  rejection 
are  equally  probable 

pi 

p.   16 

Lot  fraction  defective  for  which  it  is  desir- 
able that  the  probability  of  rejection  be 
at  most  a;  (see  AQL) 

Pi 

p.   16 

Lot  fraction  defective  for  which  it  is  desir- 
able that  the  probability  of  acceptance  be 
at  most  p;  (see  LTPD) 

p' 

pp.  30, 

33 

Lot  fraction  defective  for  which  the  prob- 
ability of  acceptance  (Lp')  is  1  —  Lp 

1       It 

Po,  Po 

pp.  55, 

64 

Two  parts  (p'0  <  p")  into  which  p0  is 
arbitrarily  divided,  that  is,  p'0  +  p"  =  p0 

p 

p.  43 

Point  estimate,  corresponding  with  a  con- 
fidence coefficient  of  zero,  of  the  process 
average  percent  defective.  Probability 
that  a  normal  deviate  will  exceed  k0 

f)y,     Py 

p.  43 

Lower  and  upper  confidence  limits,  respec- 
tively, for  fraction  defective  where  y  is  the 
confidence  coefficient 

s 

pp.  30, 

32 

\/l/N  +  k*/[2(.N  -  1)].  Quantity  used  in 
computations  for  OC  curve 

s 

p.   12 

Sample  estimate  of  the  standard  deviation 

equatio 

ns  (5),  (6) 

of  a  variable 

S{ 

p.  43 

Sample  estimate  of  the  standard  deviation 

{i    =    1,   •    • •    ,   h) 

of  a  variable  for  the  ith.  sample  of  a  group 
of  h  samples 

-S2 

p.   12 

Sample  estimate  of  the  variance  (squared 

equations  (5),  (6) 

standard  deviation)  of  a  variable 

2 

p.   12 

Summation  over  a  sample.     The  variable 

equations  (4),  (5) 

following    2  is  to  be  summed  over  the 

sample 

<x 

p.  9 

Population  standard  deviation  of  a  nor- 
mally distributed  variable 

a(N) 

p.  61 

Population  standard  deviation  of  the  nor- 

equation (139) 

mally  distributed  variable  z,  a  function  of 

the  sample  size  N 

p.   72 

A  variable  having  the  non-central  t  distri- 
bution; in  this  chapter,  used  to  desig- 
nate the  variable  y/Nu  =  (\/N/s)(L  —  x) 

t{f,8,e) 

p.  73 

Johnson    and    Welch    notation    for    that 

Sec.  11.4.5 

value    of    t0    for  which   the   probability 

P(f,8,t0)  takes  a  specified  value  e 
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Symbol 

See 

Meaning 

^0 

p.  73 

Johnson  and  Welch  notation  for  a  given 
level  of  non-central  t 

fa,    t2> 

pp.  75,  76 

Successive  approximations  obtained  in  the 

iterative  method  for  calculating  t(f,8,e) 

u 

p.  9 

Upper  limit  of  a  variable,  above  which  an 
item  is  defective 

u 

p.  72 

Johnson  and  Welch  notation  for  standard- 
ized normal  deviate  exceeded  with  prob- 
ability p;  corresponds  with  our  K 

Ui 

p.  76 

Johnson  and  Welch  notation  for  the  normal 
deviate  exceeded  with  probability  pi 

U' 

p.  48 

Upper  limit  of  the  mean  of  a  set  of  items, 
above  which  the  set  is  defective 

uy 

p.  47 

Tolerance  limit  on  U  for  a  confidence  coeffi- 

equation 

(91) 

cient  y.  Minimum  estimate  of  U  sufficient 
to  give  a  confidence  y  that  the  fraction 
defective  will  not  exceed  a  fixed  pi 

u 

p.  72 

Johnson  and  Welch  notation  for  sample 
statistic  (L  —  x)/s,  that  is,  the  observed 
difference  between  L  and  the  sample  mean 
in  units  of  the  sample  standard  deviation : 
corresponds  with  our  k0 

Wo 

p.  72 

Johnson  and  Welch  notation  for  critical 
value  of  statistic  u;  corresponds  with  our  k 

w 

p.  73 

Johnson  and  Welch  notation  for  quantity 
in  denominator  of  noncentral  t  denoting  a 
variable  distributed,  independently  of  z, 
as  x2//  with  /  degrees  of  freedom;  cor- 
responds with  our  (s/<r)2 

X 

p.  34 

Xx,  the  sum  of  the  observations 

X 

p.  67 

Value    of    X    at    which    minimum    value 

equation  (183) 

YR  of  YR  occurs 

Xi  or  x 

p.   12 

A  sample  observation.     A  measurement  on 

(f  =  1,  •'  •  •  , .N) 

a  variable  quality  characteristic 

X 

p.   12 

Sample  mean  of  a  variable  x 

equation 

(4) 

Xi 

p.  43 

Sample  mean  of  a  variable  x  for  the  ith 

a  =  i,  •  •  •  ,  k) 

sample  of  a  group  of  h  samples 

x' 

p.  56 

Abscissa    of   the    point   having   the    same 

equation 

(113) 

ordinate  as  the  point  (x,s)  on  the  bound- 
ary of  the  acceptance  region  for  a  two- 
sided  criterion 

X2 

p.  73 

A  variable  having  the  chi-square  distribu- 
tion 

Y 

p.  34 

Xx2,  the  sum  of  the  squares  of  the  observa- 
tions 
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Symbol 

See 

Meaning 

YE 

p.  36 

Boundary  between   error  and  acceptance 

equation  (70) 

regions   when   values    of    X   are    plotted 

horizontally  and  values  of   Y  vertically; 

minimum  value  of  Y  that  can  be  obtained 

for  any  given  value  of  X 

YR 

p.  36 

Boundary  between  rejection  and  accept- 
ance regions  when  values  of  X  are  plotted 
horizontally  and  values  of  Y  vertically; 
parabola  defined  by  equation  (55),  using 
the  equality  sign 

Y' 

p.  37 

Sum  of  squares  of  adjusted  observations, 

equation 

(71) 

that  is,  observations  from  which  a 
constant  has  been  subtracted 

Y'E 

p.   37 

Boundary  between  the  error  and  accept- 

equation (74) 

ance  regions  when  values  of  X  are  plotted 

horizontally  and  values  of  Y'  vertically 

Y'R 

p.   37 

Boundary     between     the     rejection     and 

equation 

(73) 

acceptance  regions  when  values  of  X  are 
plotted  horizontally  and  values  of  Y' 
vertically 

YR 

p.   67 

Minimum  value  of  the  curve  Yr  ;  a  value  of 

equation 

(184) 

Y  below  which  the  lot  is  acceptable  with- 
out regard  to  X 

1  R 

p.   69 

Minimum  value  of  the  curve  YR ;  a  value  of 

equation  (195) 

Y'    below    which    the    lot    is    acceptable 

without  regard  to  X 

y 

p.   74 

Johnson  and  Welch  notation  for  a  function 

equation 

(225) 

of  t0  and  /,  one  of  the  arguments  against 
which  X  is  tabled 

y' 

p.   74 

Johnson  and  Welch  notation  for  a  function 

equation 

(226) 

of  to  and  /,  one  of  the  arguments  against 
which  X  is  tabled 

z 

p.   61 

Population  mean  of  the  normally  distri- 
buted variable  z 

Z: 

p.   61 

Value  for  Z  for  which  it  is  required  that  the 
probability  of  rejection  be  a 

z2 

p.   61 

Value  of  Z  for  which  it  is  required  that  the 
probability  of  acceptance  be  0 

z 

p.  61 

A  normally  distributed  variable 

z 

p.   73 

Johnson  and  Welch  notation  for  a  variable 
distributed  normally  about  zero  with  unit 
standard  deviation 

ZA 

p.  85 

x  +  kAs 

Zr 

p.  85 

x  +  kRs 
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CHAPTER  2 
TOLERANCE  LIMITS  FOR  NORMAL  DISTRIBUTIONS 

1.  NATURE  OF  TOLERANCE  LIMITS 

1.1.  Definition  of  Tolerance  Limits 

A  common  method  of  specifying  the  quality  of  manufactured  product 
is  to  set  a  range,  the  bounds  of  which  are  called  " tolerance  limits,"  such 
that  a  certain  percentage  of  the  product  made  under  commercial  condi- 
tions may.  be  expected  to  have  a  quality  falling  between  the  limits.  For 
example,  tolerance  limits  may  be  placed  on  dimensions  to  indicate  the 
extent  of  interchangeability  of  parts.  Tolerance  limits  are  also  used 
in  other  fields,  for  example  medicine  and  psychology,  to  indicate  the 
range  of  normal  variation  in  various  characteristics.  In  general,  toler- 
ance limits  are  limits  within  which  lie  a  certain  percentage  of  the  indi- 
viduals of  a  statistical  universe. 

The  satisfactory  application  of  tolerance  limits  requires  that  assign- 
able causes  of  variability  be  detected  and  eliminated  so  that  the  remaining 
variability  may  be  considered  random.  In  other  words,  prediction  from 
past  data  of  the  proportion  of  future  observations  that  will  be  contained 
within  fixed  limits  assumes  that  the  successive  values  of  the  quality 
characteristic  in  question  may  be  considered  randomly  chosen  from  a 
statistical  universe.* 

1.2.  Class  of  Tolerance  Limits  Considered 

The  tolerance  limits  considered  in  this  chapter  are  symmetrical  about 
the  mean,  and  they  are  based  on  the  assumption  that  the  underlying 
statistical  universe  may  be  described  by  a  normal  distribution.  When 
the  true  mean  and  standard  deviation  n  and  a  of  this  normal  distribution 
are  known,  tolerance  limits  are  formed  by  adding  to  and  subtracting  from 
the  mean  a  certain  multiple  K  of  the  standard  deviation ;  the  appropriate 
multiple  is  easily  obtained  from  tables  of  the  normal  distribution  and 
depends  only  upon  what  proportion  of  the  universe  is  to  be  included 
within  the  limits,  f     Values  of  K  corresponding  to  various  proportions 

*  A  detailed  discussion  of  the  nature,  use,  and  practical  basis  for  tolerance  limits 
may  be  found  in  Shewhart,  Statistical  Method,  Chap.  II,  pp.  50-79.  In  fact,  this  is 
the  original  source  on  which  subsequent  work  on  tolerance  limits  has  been  based. 
(For  full  citations,  see  Sec.  4.) 

f  See  Chap.  1,  Sec.  2,  for  further  discussion  of  this  point. 
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included  within  the  tolerance  range  may  be  found  by  entering  Table  1.1 
of  Chap.  1  with  e  equal  to  half  the  proportion  excluded.  For  example,  90 
percent  of  the  population  is  included  in  the  interval  \i  —  1.645o-  to 
[i  +  1.645<r,  since  90  percent  inclusion  corresponds  with  10  percent 
exclusion,  ore  =  0.05. 

In  most  practical  situations  the  true  values  m  and  u  are  not  known, 
and  tolerance  limits  must  be  estimated  from  the  mean  X  and  the  standard 
deviation  s  of  a  sample  of  N  observations.  While  the  limits  m  ±  1.645o- 
will  include  90  percent  of  the  universe,  clearly  the  same  assertion  cannot 
be  made  of  the  limits  X  ±  1.645s.  The  proportion  included  between  the 
limits  X  ±  Ks  depends,  for  example,  on  whether  the  sample  mean  is 
unusually  large,  reasonably  close  to  the  population  mean,  or  unusually 
small.  The  sample  standard  deviation  is  also  subject  to  chance  fluctua- 
tions that  affect  the  proportion  included  between  the  limits.  In  fact,  if  a 
value  of  K  is  fixed,  no  two  samples  from  a  given  universe  will  lead  to 
the  same  tolerance  limits,  since  both  X  and  s  will  vary  from  sample  to 
sample. 

It  is  clear  then  that  the  proportion  of  the  universe  included  between 
the  limits  X±  Ks  is  a  random  variable,  and  that  it  is  impossible  to 
determine  K  so  that  the  limits  will  always  contain  a  specified  proportion 
P  of  the  universe.  It  is  possible,  however,  to  determine  K  in  such  a  way 
that  in  a  large  series  of  samples  from  normal  universes  a  certain  propor- 
tion 7  of  the  intervals  X  ±  Ks  will  include  P  or  more  of  the  universe; 
7  is  called  "the  confidence  coefficient/'  since  it  is  a  measure  of  the  confi- 
dence with  which  we  may  assert  that  a  given  tolerance  range  includes  at 
least  P  of  the  universe. 

Intuitively,  it  seems  reasonable  that  the  values  of  K  for  use  with 
sample  means  and  standard  deviations  will  be  somewhat  larger  than  the 
values  used  if  the  true  mean  and  standard  deviation  are  known.  In 
fact,  by  taking  K  very  large  the  probability  that  the  interval  X  ±  Ks 
will  contain  at  least  P  of  the  universe  may  be  made  as  close  to  1  as  is 
desired.  Since  the  length  of  the  tolerance  interval  is  2Ks,  the  smaller  K 
is,  the  more  meaningful  and  the  less  trivial  a  tolerance  range  is.  The 
position  is,  therefore,  one  of  being  able  to  state  broad  generalities  with 
little  risk  of  error,  or  to  make  more  precise  statements  at  greater  risk. 
The  problem  is  to  find  the  smallest  factor  K  that  may  be  associated  with  a 
specified  confidence  coefficient  y,  proportion  P,  and  sample  size  N. 
Table  2. 1  presents  such  factors  for  various  combinations  of  7,  P,  and  N. 
Before  discussing  the  use  of  Table  2.1,  it  is  desirable  to  contrast  the  tol- 
erance limits  it  yields  with  other  kinds  of  limits. 

1.3.  Other  Kinds  of  Limits 

1.3.1.  One-sided  Tolerance  Limits. — Instead  of  specifying  a  two- 
sided  interval  that  will  contain  a  certain  percentage  of  the  product,  it 
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will  frequently  be  appropriate  to  specify  a  one-sided  limit  such  that  a 
fixed  percentage  of  the  product  will  have  a  quality  greater  than  (or  a 
quality  less  than,  as  the  case  may  be)  this  limit.  Problems  of  this  sort 
may  be  solved  by  the  methods  of  Chap.  1,  Sec.  9.2. 

1.3.2.  Multivariate  Tolerance  Limits. — Certain  methods  of  setting 
tolerance  limits  have  been  generalized  by  Wald  and  by  Paulson  to  the 
problem  of  setting  simultaneous  limits  on  several  quality  characteristics 
of  a  single  item.     In  this  connection,  see  also  Chap.  3. 

1.3.3.  Tolerance  Limits  for  Non-normal  Distributions. — Wald  has 
considered  the  problem  of  setting  tolerance  limits  on  the  basis  of  large 
samples  for  distributions  of  any  known  form,  normal  or  otherwise. 
Application  of  these  tolerance  limits  to  normal  distributions  gives  unsatis- 
factory results;  their  applicability  (as  contrasted  with  their  theoretical 
correctness  when  N  approaches  infinity)  to  other  forms  of  distribution 
has  not  been  determined. 

1.3.4.  Distribution-free  Tolerance  Limits. — Sometimes  it  is  desirable 
to  set  tolerance  limits  without  making  any  assumption  about  the  form, 
normal  or  otherwise,  of  the  underlying  distribution.  Problems  of  this 
sort  have  been  discussed  by  Wilks  and  by  Wald.  The  problem  of  deciding 
whether  to  use  tolerance  limits  based  on  the  normal  distribution,  those 
based  on  some  other  distribution,  or  those  which  involve  no  assumption 
about  the  form  of  the  distribution  is  one  which,  though  of  a  kind  common 
in  applied  statistics,  has  no  satisfactory  solution.  Tolerance  limits  based 
on  the  normal  distribution  are  substantially  shorter  for  a  fixed  sample 
size  than  those  based  on  no  assumption  about  the  distribution;  but  they 
may  be  irrelevant  if  the  underlying  distribution  is  too  far  from  normal. 
The  decision  about  the  adequacy  of  the  normality  assumption  may  be 
based  upon  an  analysis  of  measurements  of  a  large  number  of  items;  it 
may  be  supported  by  theoretical  considerations,  for  example,  when  the 
quality  characteristic  is  known  to  be  determined  by  many  chance  causes 
whose  effects  are  essentially  additive;  or  it  may  rest  on  a  combination  of 
theoretical  and  empirical  evidence.  It  should  be  noted  that  if  there  is  a 
stable  universe  from  which  the  measurements  are  random  observations, 
limits  of  the  form  X  ±  Ks  will,  in  a  fraction  y  of  samples,  include  a 
certain  proportion  P'  or  more  of  the  universe,  where  P'  differs  from  the 
intended  P  by  an  amount  depending  upon  the  departure  of  the  true 
distribution  from  normality.* 

IS  1.3.5.  Confidence  Limits. — Tolerance  limits  should  not  be  confused 
with  confidence  limits  (or  with  fiducial  limits)  for  the  mean  or  other  param- 
eters of  the  distribution.  Confidence  limits  for  the  mean,  which  also 
are  of  the  form  X  ±  Ks,  are  computed  in  such  a  way  that  in  a  fraction 
7  of  the  samples  from  which  they  are  computed  they  will  include  the  mean 


*  For  a  fuller  and  more  precise  statement  of  this  point,  see  Chap.  1,  Sec.  9.4, 
including  the  quotation  from  Jennett  and  Welch. 
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of  the  distribution.  Tolerance  limits,  in  contrast,  are  computed  in  such  a 
way  that  in  a  fraction  7  of  the  samples  from  which  they  are  computed  they 
will  include  at  least  a  fraction  P  of  the  items  in  the  distribution.  Thus, 
as  the  sample  size  increases,  the  value  of  K  for  a  confidence  interval 
diminishes  toward  zero,  whereas  the  value  of  K  for  a  tolerance  interval 
diminishes  toward  the  normal  deviate  including  a  proportion  P  of  the 
universe. 

2.  TABLE  OF  FACTORS  FOR  TOLERANCE  LIMITS 

2.1.  Description 

Table  2.1  shows  3,880  values  of  the  multiple  K  of  the  sample  standard 
deviation  s  to  be  added  to  and  subtracted  from  the  sample  mean  X,  for 
various  values  of  the  confidence  coefficient  7,  the  proportion  P  of  the 
distribution  to  be  included  within  the  tolerance  limits  and  the  sample 
size  N.  Four  values  of  7  are  given:  0.75,  0.90,  0.95,  and  0.99;  five  values 
of  P:  0.75,  0.90,  0.95,  0.99,  and  0.999;  and  194  values  of  N:  from  2  to 
102  by  steps  of  1,  from  102  to  180  by  steps  of  2,  from  180  to  300  by  steps 
of  5,  from  300  to  400  by  steps  of  10,  from  400  to  750  by  steps  of  25,  from 
750  to  1,000  by  steps  of  50,  and  infinity. 

2.2.  Use  of  the  Table  :  Example 

In  connection  with  fire-control  problems  for  a  certain  projectile  it  is 
desirable  to  know,  with  a  confidence  coefficient  of  0.99,  limits  within  which 
the  muzzle  velocities  of  90  percent  of  the  future  rounds  will  lie.  The 
required  calculations  are  performed  on  the  observed  velocities  as  follows : 
First,  add  all  the  observations,  calling  the  sum  2X.  Then  add  all  the 
squares  of  the  observations,  calling  the  sum  2X2.  These  two  sums  and  N, 
the  number  of  observations,  are  the  only  information  needed  from  the 
sample;  in  the  present  example  they  are 

2X  =  291,200  ft.  per  sec.  (1) 

2X2  =  393,114,400  (2) 

N  =  216  (3) 

The  mean  and  standard  deviation  are  computed  as  follows : 

X  =  =±  =  1,348.15  ft.  per  sec.  (4) 
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N(N  _  ij        =  49.82  ft.  per  sec.  (5) 

Entering  Table  2.1  with  P  =  0.90  and  7  =  0.99,  we  find  for  N  =  215, 
K  =  1.856,  and  for  N  =  220,  K  =  1.854;  by  linear  interpolation  we  find 
that  for  N  =  216,  K  =  1.856.*     The  required  tolerance  interval,  in  feet 

*  Linear  interpolation  with  respect  to  N  will  not  result  in  an  error  of  more  than 
1  in  the  third  decimal  place. 
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per  second,  is  then  1,348.15  ±  1.856  X  49.82;  so  we  may  assert  with  99 
percent  confidence  that  90  per  cent  of  all  the  future  observations  will  lie 
between  1,255.7  and  1,440.6  ft.  per  sec. 

2.3.  Cases  outside  the  Range  of  the  Table 

2.3.1.  Formula  for  Large  Values  of  N. — For  values  of  N  beyond  the 
range  of  the  table,  the  following  formula  may  be  used:* 

K  =  kJ\  +  -%=  +  5X*  +  10)  (6) 

\         y/2N  12N     )  K  J 

where  K^  (the  value  of  K  we  would  use  if  the  true  fi  and  a  were  known)  is 
denned  by 


■        '         e    2dt  =P  (7) 


and  xy  is  denned  by 


V2 


1       /•-■_*! 


Values  of  K„  are  listed  in  Table  2. 1  under  the  appropriate  P  columns  and 
are  the  same  for  all  y;  they  may  also  be  determined  easily  from  Table  1.1 
in  Chap.  1.  Values  of  xy  may  be  found  from  any  table  of  the  areas  of 
the  normal  curve,  such  as  Table  1.1.  For  the  values  of  y  used  in  Table 
2.1,  however,  Table  2.2  gives  the  values  of  the  functions  of  x  required 
within  the  parentheses  of  formula  (6). 

Unless  N  is  too  small,  formula  (6)  also  gives  results  of  satisfactory 
accuracy  for  values  of  P  and  y  not  included  in  Table  2.1.  When  N 
>  160,  y  <  0.99,  and  P  <  0.999,  formula  (6)  gives  values  of  K  agreeing 
within  1  unit  in  the  third  significant  figure  with  those  in  Table  2. 1 . 

2.3.2.  Example. — In  studying  another  characteristic  of  the  projectile 
referred  to  in  Sec.  2.2,  the  following  sample  values  were  obtained : 

N  =  1,449  (9) 

X  =  670.40  (10) 

s  =  37.45  (11) 

Again  assuming  y  =  0.99  and  P  =  0.90,  we  have,  from  Table  2.2 
and  formula  (6), 

K  =  1.645  (l  +  i^E  +  ^883\  =  L72Q  (12) 

\         v  1449         1449  / 

The  upper  and  lower  tolerance  limits  are,  therefore,  734.81  and  605.99, 
respectively. 

*  This  formula  was  developed  in  the  Statistical  Research  Group  by  Albert  H. 
Bowker;  see  his  paper  cited  in  Sec.  4. 
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TABLE  2.2 
Coefficients  to  Facilitate  the  Use  of  Formula  (6)  for  Computing  K  When  N  Is 

Large 


Coefficient 

Value 

y  =  0.75 

y  =  0.90 

y  =  0.95 

7  =  0.99 

Term  in 

V2 

5Xy2     +     10 

12 

0.4769 
1.0229 

0 . 9062 
1.5177 

1.1631 
1 . 9606 

1 . 6450 
3 . 0883 

Term  in  -n 

N 

3.  CONSTRUCTION  OF  THE  TABLE 

3.1.  Formula 

The  formula  for  K  used  in  Table  2.1  is* 


K 


S. 


(13) 


where  N  is  the  number  of  observations,  P  is  the  proportion  of  the  distri- 
bution to  be  included  within  the  tolerance  limits,  y  is  the  confidence 
coefficient,  r  is  a  function  of  1/y/N  and  P  defined  by 


y/tor]  _i__r 


and  Xy  is  defined  by 


2  dt  =  P  (14) 

Vn 

P(X2  >  X\)    =  7  (15) 

in  terms  of  the  chi-square  distribution  for  n  degrees  of  freedom,  where 

n  =  N  -  1.  (16) 

3.2.  Basic  Network  of  Points 

K  was  computed  directly  from  formula  (13)  for  all  values  of  n  (that 
is,  of  N  —  1)  from  1  to  30,  for  all  even  values  of  n  from  30  to  60,  for 
values  of  n  from  60  to  200  at  intervals  of  10,  and  for  values  of  n  from  199 
to  999  at  intervals  of  50.  For  other  values  of  n,  values  of  K  were  inter- 
polated into  this  basic  network  of  points  by  means  of  4-point  Lagran- 
gian  interpolatory  coefficients  taken  from  the  table  published  by  the  New 
York  Mathematical  Tables  Project. 

3.3.  Percentage  Points  of  x2 

For  all  values  of  n  from  1  to  30  and  for  values  from  30  to  100  at 
intervals  of  10,  the  percentage  points  of  x2  were  obtained  from  Thomp- 

*  This  formula  was  developed  in  the  Statistical  Research  Group  by  A.  Wald  and 
J.  Wolfowitz;  see  their  paper  cited  in  Sec.  4. 
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son's  table.  Since  for  even  n  the  probability  of  exceeding  any  given  value 
X2  is  given  by  the  partial  sum  of  a  Poisson  series,  the  required  values 
between  n  =  30  and  n  =  60  (except  n  =  40  and  n  =  50)  were  obtained 
by  inverse  interpolation  in  Molina's  table*  using  Salzer's  inverse  inter- 
polation coefficients.  The  percentage  points  of  x2for  n  >  100  were  found 
from  a  Cornish-Fisher  type  of  expansion  applied  to  the  x2  distribution.! 

3.4.  Computation  of  r 

To  compute  r,  Newton's  method  was  applied  to  the  function 

G{r)  =  ~^=      VN     e    2  tit  -  P  =  0  (17) 

V  2tt7  _i__r 

Vn 

The  NYMTP  tables  of  the  probability  integral  are  conveniently 
arranged  for  this,  since  the  ordinates  of  the  normal  curve  are  tabulated 
beside  the  integral,  and  forward  interpolation  in  both  the  integral  and 
the  ordinate  may  be  expressed  simply  in  terms  of  the  ordinate. 
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CHAPTER  3 

MULTIVARIATE  QUALITY  CONTROL,  ILLUSTRATED 
BY  THE  AIR  TESTING  OF  SAMPLE  BOMBSIGHTS 

1.  INTRODUCTION 
1.1.  Type  of  Problem 

Systematic  plans  of  sampling  for  acceptance  inspection  or  quality 
control  are  for  the  most  part  based  on  simple  features  of  the  individual 
articles  inspected.  Usually  each  article  is  classified  merely  as  defective 
or  nondefective.  In  a  minority  of  cases  there  is  a  finer  classification 
involving  measurement  of  continuous  variates.  In  the  inspection  of 
simple  articles,  including  component  parts  of  complicated  assemblies, 
inspection  by  attributes  is  considered  appropriate;  but  in  the  inspection 
of  complicated  assemblies,  especially  when  inspection  is  costly,  measure- 
ments are  apt  to  be  advantageous.*  Furthermore,  the  performance  of 
complicated  assemblies  is  usually  a  function  of  several  different  variables. 
When  several  quality  characteristics  are  measured,  there  arises  a  problem 
of  combining  the  several  measurements  on  each  article  into  a  single 
measure  (" score"  or  " index")  of  the  over-all  performance  of  the  article. f 

In  the  absence  of  a  formally  defined  procedure,  methods  of  a  more  or 
less  subjective  character  for  combining  the  different  measurements  to 
form  a  judgment  of  degree  of  acceptability  are  bound  to  develop.  The 
simplest  such  method  is  to  apply  customary  practices  to  each  variate 
separately  and  to  reject  whenever  the  usual  rules  applied  to  any  one 
variate  alone  lead  to  a  verdict  of  rejection.  This  procedure,  however, 
has  the  effect  of  changing  drastically  the  probabilities,  both  of  accepting 
what  is  bad  and  of  rejecting  what  is  good.  For  example,  if  a  sampling 
scheme  is  designed  to  give  a  probability  of  0.05  of  rejecting  a  lot  of  a 
certain  quality  on  the  basis  of  measurements  of  one  variate,  and  if  this 
scheme  is  applied  to  each  of  two  variates  measured  on  the  same  articles, 
then  if  the  two  variates  are  independently  distributed,  rejection  on  the 
basis  of  one  or  the  other  variate  will  take  place  in  9.75  per  cent  of  sub- 
mitted lots  of  the  given  quality  instead  of  5  per  cent  as  originally  con- 
templated. The  discrepancy  between  the  actual  and  the  intended 
probabilities  of  rejection  will  increase  rapidly  with  the  number  of  inde- 
pendent variates  when  such  a  rule  is  followed. 

*  For  a  further  discussion  of  this  point,  see  Chap.  1,  Sec.  1. 
f  For  a  further  discussion  of  this  point,  see  Chap.  1,  Sec.  10.1. 
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An  alternative  rule,  erring  in  the  opposite  direction,  would  be  to 
accept  any  lot  for  which  the  univariate  scheme  applied  to  any  one  of 
the  variates  calls  for  acceptance.  Thus  rejection  would  require  that  all 
the  variates  take  such  values  as  to  call  for  rejection.  For  two  inde- 
pendent variates,  a  probability  of  rejection  intended  to  be  0.90  would 
actually  in  such  a  case  be  only  0.81. 

These  probabilities  can  be  adjusted  so  as  to  become  equal  to  0.05, 
or  such  level  as  is  chosen,  by  altering  the  acceptance  level  for  each  variate 
separately.  However,  this  brings  in  additional  difficulties.  The  variates 
may  not  be  mutually  independent,  and  then  calculations  such  as  the  fore- 
going must  be  altered  to  take  into  account  the  multivariate  distribution. 
Furthermore,  it  will  often  not  be  known  whether  they  are  independent  or 
not;  or  if  they  are  mutually  dependent,  the  character  of  the  dependence 
may  be  known  only  imperfectly.  Thus  the  correlation  coefficient  may 
have  to  be  estimated  from  a  preliminary  sample  of  size  so  small  as  to 
leave  its  exact  value  somewhat  uncertain.  Any  acceptance  probabilities 
based  on  such  a  correlation  coefficient  will  then  likewise  be  uncertain. 

Rules  such  as  those  suggested  above  have  another  defect.  An  article 
close  to  the  margin  of  acceptability  with  respect  to  one  variate  may  well 
be  marked  for  acceptance  or  rejection  on  the  basis  of  the  other  variates 
involved.  Unusual  excellence  in  one  respect  may  often  atone  for  a  slight 
departure  in  another  way  from  what  would  otherwise  be  considered 
satisfactory.  A  formula  is  then  needed  for  combining  the  measures  in 
the  several  dimensions  into  a  single  measure  of  excellence,  or  of  adherence 
to  specifications  or  to  a  desired  ideally  uniform  set  of  dimensions.  Such 
a  formula  should  give  a  measure  of  departure  from  the  ideal  that  will 
increase  with  a  departure  in  any  dimension.  It  should  lend  itself  to  an 
exact  interpretation  in  terms  of  probability,  taking  account  of  the  neces- 
sarily limited  size  of  any  preliminary  sample  used  to  determine  the 
parameters  entering  into  the  formula. 

Formulas  of  this  type  are  here  developed  on  the  basis  of  an  expression 
known  as  the  generalized  Student  ratio  T,  introduced  in  1931  for  the 
testing  of  hypotheses  about  the  location  of  the  means  of  multivariate 
distributions  and  closely  related  to  the  exponent  in  the  multivariate 
normal  distribution.  *  If  errors  in  two  or  more  dimensions  were  mutually 
uncorrelated  and  of  equal  variance  in  the  sample  representing  past 
experience,  the  value  of  T  for  a  new  individual  would  be  directly  pro- 
portional to  the  distance  from  the  point  whose  rectangular  coordinates 
represent  the  ideal  to  the  point  representing  the  actual  measurements. 
Thus,  for  a  bomb  dropped  on  a  target,  T  would  in  such  a  case  be  simply 

*  See  Hotelling,  "The  Generalization  of  Student's  Ratio."  (For  full  citations, 
see  Sec.  6.) 
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the  distance  of  the  bomb  from  the  center  of  the  target,  expressed  in  terms 
of  a  unit  equal  to  the  standard  deviation  found  in  the  earlier  experience. 
Actually,  the  variances  of  range  errors  in  bombing  are  appreciably  less 
than  those  of  deflection  errors.*  Consequently,  the  expression  used  for 
T2  has  different  coefficients  for  the  two  squared  terms.  In  addition,  it 
has  a  term  in  the  product  of  range  and  deflection  errors,  the  small  coef- 
ficient of  which  results  from  a  slight  correlation  between  the  two  kinds  of 
error  in  the  available  experience.  This  T2  serves  as  a  measure  of  the 
deviation  of  the  particular  bomb  from  the  center  of  the  target,  a  measure 
more  accurately  interpretable  in  terms  of  probability  than  is  the  actual 
distance.  By  adding  the  values  of  T2  for  all  the  bombs  dropped  with  a 
particular  bombsight,  we  have  a  measure  useful  in  judging  the  accuracy  of 
that  bombsight.  There  are  also  other  ways  of  combining  these  T"s  to 
throw  light  on  particular  features. 

A  great  advantage  of  all  these  T  statistics  is  that,  in  samples  from 
normal  distributions,  they  have  probability  distributions  that  can  be 
determined  exactly.  Many  statistics  have  distributions  involving  what 
may  be  called  " nuisance  parameters,"  that  is,  parameters  which  are  not 
known  exactly  from  general  considerations  or  basic  scientific  constants 
but  have  to  be  estimated,  with  more  or  less  uncertainty,  from  fallible 
observations.  This  is  true,  for  example,  of  the  actual  distance  of  a 
bomb  from  a  target,  whose  distribution  depends  not  only  on  the  mean 
value  of  such  distances  but  also,  and  in  a  complicated  way,  on  the  direction 
of  the  deviation.  The  parameters  specifying  the  distribution  of  the 
direction — which  is  really  irrelevant — are  nuisance  parameters  in  the 
distribution  of  the  actual  distance,  confusing  its  interpretation  as  a 
measure  of  accuracy  of  the  bomb.  But  the  T  statistics  have  been  con- 
structed in  such  a  way  that  there  are  no  nuisance  parameters  involved  in 
their  distributions,  at  least  as  long  as  we  are  dealing  with  data  from  a 
fundamental  distribution  of  the  normal  form. 

It  is  to  be  expected  that  these  T  statistics  will  have  many  applica- 
tions in  sampling  inspection  and  in  other  fields.  In  particular,  wherever 
a  manufactured  article  is  to  be  judged,  not  on  the  basis  of  measurement  of 
parts,  but  on  the  basis  of  final  performance  as  measured  by  several  vari- 
ables, there  seems  to  be  room  for  the  utilization  of  such  methods.  A  case 
of  this  kind,  which  is  here  considered  in  detail  and  for  which  the  methods 
presented  here  were  originated,  is  that  of  air  testing  sample  bombsights. 
It  is  presented  here  as  a  precedent  that  will  be  useful  in  other  situations 
in  which  final  performance  is  determined  by  multiple  measurements. 

*  An  error  in  range  is  an  error  in  the  direction  of  the  airplane's  heading  at  the 
time  of  release.  An  error  in  deflection  is  an  error  in  a  direction  perpendicular  to  the 
plane's  heading. 
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The  mathematical  theory  is  not  given  in  full  in  this  chapter,  but  the 
methods  recommended  will,  it  is  believed,  be  fully  intelligible  on  the  basis 
of  the  material  presented  here.* 

Besides  the  T  distribution,  several  other  innovations  are  here  sug- 
gested, both  graphic  devices  and  methods  of  applying  to  statistical 
quality  control  some  of  the  ingenious  devices  and  concepts  that  R.  A. 
Fisher  has  introduced  into  the  design  of  experiments. 

1.2.  Problems  of  Air  Testing  Bombsights 

The  dropping  of  bombs  from  airplanes  for  the  purpose  of  testing 
bombsights  submitted  by  manufacturers  is  a  phase  of  quality  control. 
The  art  and  the  literature  of  quality  control,  so  greatly  expanded  of  late, 
are  to  a  certain  extent  applicable  to  this  bombing  procedure.  Yet  test 
bombing  presents  several  novel  features  that  make  the  situation  very 
different  from  those  heretofore  dealt  with  in  the  literature  and  usual  prac- 
tice of  quality  control.  These  peculiar  features,  which  make  this  no 
ordinary  problem  and  render  the  usual  methods  inadequate  and  to  some 
extent  inappropriate,  are  the  following: 

(1)  In  contrast  to  the  simple  articles  to  which  routine  mass  sampling 
inspection  is  usually  applied,  with  at  most  one  or  two  numbers  recorded 
for  each  article  tested,  and  often  merely  a  "good"  or  "bad"  verdict,  the 
bombsight  is  a  highly  complex  instrument  with  a  multiplicity  of  parts, 
each  of  which  is  subject  to  various  possible  defects. 

(2)  The  process  of  putting  together  these  parts  in  selective  assembly, 
matching  an  oversize  part  with  an  undersize  part  to  fit  in  with  it,  grind- 
ing and  adjusting  when  necessary,  is  a  series  of  jobs  requiring  great  skill, 
whose  results  are  to  be  judged  by  the  performance  of  the  finished  instru- 
ment rather  than  by  measurement  of  parts.  Performance,  however,  is  a 
complicated  thing.  If  quality  is  to  be  measured  by  performance  and 
expressed  by  one  or  a  few  numbers,  caution  must  be  exercised  in  the 
choice  of  these  numbers  and  their  interpretation. 

(3)  Air  testing  is  only  one  of  a  series  of  tests  and  inspections  to  which 
a  bombsight  and  its  parts  are  subjected.  It  is  the  final  step,  and  an 
exceptionally  costly  one.  For  elimination  of  defects  the  main  reliance 
is  properly  placed  on  earlier  and  cheaper  steps.  A  great  deal  of  careful 
screening  out  of  defectives  has  taken  place  prior  to  the  final  stage  of  air 
testing,  with  which  alone  this  chapter  is  concerned. 

(4)  The  errors  of  test  bombing  are  not  by  any  means  due  entirely  to 
defects  in  the  sight.  In  spite  of  all  attempts  at  standardization  there  are 
bound  to  be  variations  in  the  weather,  the  altitude  from  which  the  bombs 
are  dropped,  the  bombs  themselves,  the  airplane,  the  auxiliary  equip- 

*  It  is  planned  to  present  the  mathematical  theory  in  the  Annals  of  Mathematical 
Statistics. 


Sec.  1.2  MULTIVARIATE  QUALITY  CONTROL  117 

ment,  and  the  personnel  engaged  in  flying,  bombing,  and  observing  the 
results.  The  variance  in  the  results  of  the  bombing  that  results  from 
causes  having  nothing  to  do  with  the  quality  of  the  bombsight  indicates 
that,  while  valuable  statistical  inferences  are  possible  from  large  numbers 
of  cases,  it  would  be  a  dubious  procedure  to  take  any  drastic  action  on  the 
basis  of  the  four  or  six  bombs  usually  dropped  with  a  sight,  except  perhaps 
in  the  most  extreme  cases. 

(5)  When  bombing  with  a  particular  sight  shows  unsatisfactory 
results,  the  action  ordinarily  taken  is  not  the  rejection  of  the  sight,  but 
its  adjustment  or  repair. 

(6)  Whereas  the  usual  sampling  inspection  for  quality  control  is 
carried  out  very  cheaply  for  each  single  inspection  by  a  workman  of 
moderate  skill  using  some  simple  instrument,  the  dropping  of  a  test  bomb 
requires  a  pilot,  a  bombardier,  an  airplane  with  gasoline  and  maintenance, 
an  airfield,  a  target  area,  stations  for  observing  the  point  of  impact  and 
personnel  to  man  them,  and  finally,  plotting  and  computations. 

The  function  of  air  testing  is  clearly  not  the  elimination  of  individual 
bad  sights  or  of  bad  lots  of  10  from  which  the  sights  tested  are  taken. 
This  is  done  far  more  efficiently  by  inspection  of  parts  and  by  shop  testing 
of  the  finished  bombsight,  both  by  the  manufacturer  and  by  the  govern- 
ment representatives  at  the  factory,  as  well  as  by  the  shop  testing  of  the 
sample  sent  to  the  government  proving  ground.  The  principal  functions 
which  air  testing  is  capable  of  fulfilling  appear  rather  to  be  the  following: 

(1)  Keeping  a  running  record  of  quality  designed  to  compare  the 
work  of  different  productive  units,  and  of  the  same  unit  at  different  times. 

(2)  Detection  of  irregularities,  trends  in  quality,  and  deviations  from 
customary  levels  in  the  average  performance  of  sights  produced  over  a 
period. 

(3)  Exploration  of  possible  types  of  defects,  either  in  design  or  in 
production,  not  previously  recognized. 

(4)  By  experiments  carefully  designed  so  as  not  to  conflict  with  the 
other  purposes  of  air  testing,  to  obtain  judgments  on  differing  models 
of  sights,  on  auxiliary  equipment,  on  bombing  methods,  and  on  personnel 
in  a  manner  more  efficient  than  separate  experiments  carried  out  solety  to 
obtain  these  judgments. 

The  high  cost  and  uncertainty  of  air  testing,  with  the  relative  accuracy 
and  low  cost  of  shop  testing,  indicate  that  only  a  very  small  number  of 
bombsights  should  be  air-tested.  However,  it  does  appear  that  some  air 
testing  should  go  on,  for  reasons  outside  those  usual  in  sampling  inspec- 
tion and  to  an  extent  that  cannot  be  determined  by  calculation  according 
to  customary  methods.  In  the  specific,  though  flexible,  plan  suggested 
in  Sec.  2,  the  amount  of  routine  bombing  called  for  is  one-twentieth 
of  that  under  procedures  in  effect  in  1943      The  exact  proportion  suitable 
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cannot  be  derived  by  calculation  from  the  proportion  of  defects,  which 
could  not  be  ascertained  reliably  from  the  record  because  of  the  virtual 
absence  of  defects  serious  enough  to  cause  rejection.  The  fraction  sug- 
gested represents  what  is  in  the  author's  view  a  reasonable  proportion  of 
bombsights  to  be  tested  if  the  improved  statistical  methods  described 
herein  are  utilized.  The  plan  can  easily  be  modified,  in  accordance  with 
Sec.  3,  to  fit  any  other  proportion  considered  appropriate. 

In  much  quality-control  work  where  individual  inspection  is  cheap, 
inefficient  statistical  formulas  are  used  frankly  and  deliberately  when  they 
are  easy  to  calculate  and  to  comprehend.  The  saving  in  computation 
represented  by  using  the  mean  deviation  instead  of  the  standard  devia- 
tion with  many  small  batches  may  in  some  circumstances  offset  the 
increase  of  approximately  one-eighth  in  the  amount  of  inspection  neces- 
sary to  get  the  same  accuracy.  The  range  of  a  sample  (that  is,  the  differ- 
ence between  the  greatest  and  least  of  the  values)  is  often  taken  as  a 
measure  of  dispersion  when  it  is  felt  that  the  inability  of  factory  employes 
to  understand  any  more  sophisticated  measure  warrants  the  cost  of  the 
additional  observations  required  to  obtain  the  necessary  accuracy. 

Considerations  of  this  kind  should  have  no  place  where  the  unit  cost 
is  as  great  as  that  of  dropping  a  bomb.  Information  acquired  at  such 
great  expense  should  not  be  wasted  by  the  use  of  inefficient  statistical 
methods  of  combining  the  observations.  One  object  of  this  chapter  is  to 
set  forth  the  most  appropriate  methods  of  combining  the  observations  of 
the  bomb  coordinates  and  of  making  inferences  from  them.  Some  modi- 
fications of  the  usual  kinds  of  control  charts  are  introduced.  Further- 
more, the  contribution  of  modern  statistical  methods  to  the  improvement 
of  accuracy  is  not  limited  to  formulas  for  combining  the  observations  after 
they  are  made.  Certain  precautions,  such  as  proper  randomization  and 
replication,  can  be  arranged  only  before  the  observations  are  made.  The 
application  of  these  principles  to  air-testing  sample  bombsights  by  a  plan 
in  accordance  with  modern  principles  of  experimental  design  is  another 
leading  purpose  of  this  chapter. 

Section  2  consists  of  a  specific  plan  for  the  selection  of  sample  bomb- 
sights,  their  air  testing,  the  calculations  to  be  made  from  the  results,  a 
series  of  control  charts  of  new  types,  and  some  simple  considerations 
relevant  to  interpretation  of  the  results  and  to  action. 

Since  Sec.  2  is  easily  adaptable  to  changed  conditions  and  modifica- 
tions of  the  plan,  formulas  are  given  in  Sec.  3  for  calculating  new  control 
limits  and  other  needed  quantities  under  conditions  more  general  than 
those  of  Sec.  2.  Provision  is  here  made  for  changing  the  number  of  bombs 
dropped  per  flight,  the  number  of  flights  with  each  sight,  the  number  of 
sights  to  be  taken  from  a  lot  for  air  testing,  and  the  base  period  with  which 
comparison  is  to  be  made. 
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While  Sees.  2  and  3  consist  principally  of  simple  rules  for  operating 
and  for  amending  the  sampling  plan,  with  little  mathematics  or  argu- 
mentation, Sec.  4  gives  the  background  and  reasons  for  the  plan.  The 
chief  mathematical  discussion  has,  however,  been  confined  to  a  technical 
paper,  where  the  mathematical  reader  will  find  a  more  precise  account 
of  some  of  the  new  ideas  presented  here  for  the  first  time.  These  ideas 
are  applicable  to  a  wide  range  of  problems,  not  only  in  quality  control  but 
also  in  connection  with  bombing  and  firing  in  general,  with  mental  test- 
ing, and  Avith  other  problems  in  which  two  or  more  normally  distributed 
variates  are  involved. 

Section  5  is  an  analysis  of  the  results  of  dropping  260  ore-filled  bombs. 
Measures  of  dispersion  obtained  from  these  bombs  are  substituted  into 
the  formulas  of  Sec.  3  to  obtain  the  control  limits  specified  in  the  plan  of 
Sec.  2,  together  with  coefficients  for  combining  range  and  deflection 
errors  and  other  needed  quantities.  For  any  sampling  plan,  a  base 
period  is  needed  to  get  the  plan  started,  and  this  is  provided  by  the 
material  analyzed  in  Sec.  5.  This  section  also  contains  as  by-products  a 
specimen  rating  of  equipment  and  personnel  involved  in  that  bombing, 
essentially  to  show  how  such  ratings  can  be  made,  and  a  new  simplified 
technique  of  analysis  of  variance  with  unequal  class  frequencies  developed 
for  the  purpose  of  making  these  ratings. 

This  chapter  includes  control  charts  having  several  novel  features, 
which  it  is  believed  will  also  be  found  useful  in  other  control-chart  work. 
One  is  the  choice  of  the  functions  combining  different  measures  of 
accuracy,  here  range  and  deflection  errors,  to  be  plotted.  These  func- 
tions are  invariant  under  transformations  of  coordinates  and  hence  have 
probability  distributions  capable  of  exact  determination.  Another 
feature  of  the  control  charts  here  presented  is  the  provision  of  yardsticks, 
for  visual  comparison  of  different  plotted  values,  with  a  definite  prob- 
ability interpretation. 

In  fixing  the  control  limits  corresponding  to  specified  probabilities, 
account  is  here  taken  of  the  limited  number  and  consequent  sampling 
fluctuations  of  the  observations  in  the  base  period  used  for  comparison. 
It  is  to  be  feared  that  in  many  control  charts  heretofore  in  use  there  has 
been  an  unconscious  distortion  of  the  probabilities  because  of  neglect  of 
this  consideration,  with  a  bias  uniformly  in  the  direction  of  exaggerating 
the  significance  of  moderately  large  deviations  from  norms.  Use  of 
formulas  such  as  those  of  Sec.  3  in  industrial  quality  control  should  cor- 
rect this  error. 

2.  PROPOSED  SAMPLING  PLAN 

2.1.  Selection  of  Sample  Sights  and  Arrangement  of  Bombing 

(1)  Whenever  a  new  factory  goes  into  production,  each  of  the  first 
40  bombsights  produced  and  passing  the  usual  shop  tests  of  the  govern- 


120  TECHNIQUES  OF  STATISTICAL  ANALYSIS  Chap.  3 

ment  inspectors,  both  at  the  factory  and  at  the  proving  ground,  is  to  be 
tested.  If  all  these  are  found  satisfactory,  the  remainder  of  the  plan  is 
to  be  applied  to  subsequent  production,  subject  to  Avithdrawal  with 
reversion  to  the  status  of  a  new  factory  if  later  a  series  of  unacceptable 
sights  is  offered.  (Since  the  last  contingency  has  not  hitherto  occurred, 
accurate  formulation  of  a  definite  rule  for  such  reversion  must  await  the 
accumulation  of  this  undesirable  kind  of  experience.) 

(2)  At  each  factory  that  has  been  approved  after  the  first  40  bomb- 
sights,  the  sights  of  each  model  are  to  be  numbered  consecutively  in  the 
order  of  manufacture  in  each  of  two  series.  One  series,  designated  A, 
shall  consist  of  those  sights  approved  without  unusual  delay  by  the 
government  inspector  at  the  factory  on  first  delivery  by  the  manufacturer. 
The  other  series,  designated  B,  shall  consist  of  those  sights  first  rejected 
by  the  government  inspector  and  later  approved  after  defects  have  been 
remedied,  together  with  any  other  approved  sights  whose  approval  has 
for  any  reason  been  delayed.  To  each  sight  are  to  be  affixed  both  the 
usual  serial  number  representing  original  order  of  manufacture  and  the 
letter  A  or  B,  with  the  number  within  the  A  or  B  series.  The  subse- 
quent procedures  are  to  be  applied  separately  to  the  A  and  B  series  for 
each  factory  and  for  each  model. 

(3)  Each  series  is  to  be  divided  into  successive  lots  of  20,  strictly  in 
the  order  of  original  manufacture. 

(4)  From  each  lot  of  20,  two  sights  are  to  be  selected  by  an  objective 
chance  procedure  and  sent  to  the  proving  ground.  No  element  of  per- 
sonal judgment  or  of  rating  on  inspection  should  enter  into  this  selection, 
which  may  be  made  in  the  following  manner :  the  procurement  authorities 
will  send  to  the  chief  inspector  at  the  factory  a  series  of  opaque  sealed 
envelopes  marked  on  the  outside  with  the  first  and  last  serial  numbers  of 
the  successive  batches.  An  envelope  is  to  be  opened  only  after  the 
inspection  at  the  factory  of  all  the  sights  in  a  batch  has  been  completed 
and  report  made.  Within  the  envelope  will  be  found  two  numbers 
between  1  and  20,  inclusive,  prepared  on  a  strictly  chance  basis,  with 
equal  probabilities  for  all  twenty  numbers.  This  must  be  done  through 
actual  utilization  of  a  chance  mechanism  or  of  a  table  of  random  num- 
bers, *  without  human  discretion  other  than  in  the  selection  of  the  mech- 
anism. One  obvious  procedure  would  be  to  mark  twenty  playing  cards 
with  the  numbers  1  to  20,  and  after  thoroughly  shuffling  to  draw  out  two 
of  them  at  random. 

(5)  The  remaining  18  sights  of  each  lot  are  not  to  be  held  at  the 
factory  until  completion  of  the  air  testing  of  the  sample  but  should  be 

*  For  example,  Fisher  and  Yates,  Statistical  Tables,  Table  XXXIII ;  Ken- 
dall and  Smith,  Random  Sampling  Numbers;  Tippett,  Random  Sampling 
Numbers. 
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sent  directly  to  service,  subject  to  later  recall  or  repair  in  the  unlikely 
event  that  the  sample  is  found  seriously  defective. 

(6)  Upon  arrival  at  the  proving  ground,  the  sample  sights  are  all  to 
be  shop-tested  and  adjusted  as  at  present,  with  such  further  types  of  shop 
test  as  may  be  developed. 

(7)  By  an  objective  random  procedure  like  that  described  in  Para- 
graph (4)  above,  lots  are  to  be  selected  for  the  air  testing  of  the  two  sights 
representing  the  lot  at  the  proving  ground.  It  is  suggested  that  one- 
fortieth  of  the  lots  produced  by  each  manufacturer  be  selected  by  chance. 
(It  is  likely  that  this  proportion  should  be  reduced  as  the  improved 
methods  of  shop  testing  prove  their  value.) 

(8)  Each  of  the  two  sights  representing  each  of  the  lots  selected  in 
accordance  with  Paragraph  (7)  is  to  be  taken  up  in  an  airplane  twice  to 
be  used  for  dropping  bombs. 

(9)  The  routine  flight  testing  is  to  be  arranged  in  periods,  each  con- 
sisting of  four  consecutive  days,  or  four  days  as  nearly  consecutive  as 
possible  even  though  there  may  be  breaks  on  account  of  Sundays  or 
special  circumstances.  The  number  of  test  flights  to  be  made  per  day 
should  first  be  determined  and  will  here  be  denoted  by  k.  This  same 
number  of  flights  k  should  be  made  on  each  day  of  the  4-day  period, 
though  it  is  not  essential  that  k  should  be  the  same  for  different  periods. 

(10)  The  sights  to  be  tested  in  a  4-day  period  are  to  be  the  two  sample 
sights  from  each  of  k  lots.  Each  sight  is  to  be  tested  on  two  different 
days  of  the  period,  and  each  lot  is  to  be  represented  on  all  four  days. 
Subject  to  these  conditions,  the  assignment  of  the  particular  sight  of  a 
lot  to  be  tested  on  a  specific  day  is  to  be  decided  by  objective  chance,  for 
example,  by  flipping  a  coin.  The  k  lots  represented  may  be  from  the 
same  or  different  factories.  They  may  be  of  different  models  and  may 
be  of  A  or  B  series  or  both. 

(11)  Exactly  four  bombs  are  to  be  dropped  on  each  flight.  Range 
and  deflection  errors  are  to  be  measured  with  respect  to  the  heading  of  the 
airplane  at  the  time  the  particular  bomb  was  dropped. 

(12)  The  numerical  values  for  control  limits  and  yardsticks  given  in 
this  section  are  to  be  used  only  if  the  directions  in  the  preceding  eleven 
paragraphs  are  carried  out  exactly.  If  it  is  decided  to  revise  the  plan 
by  dropping  a  different  number  of  bombs  on  each  flight,  or  by  measuring 
range  and  deflection  with  respect  to  the  true  course  instead  of  the  heading, 
the  numerical  values  will  need  to  be  amended  in  accordance  with  the 
formulas  of  Sec.  3.  The  number  of  bombs  dropped  per  flight  will  be 
denoted  by  N,  with  the  understanding  in  this  section  that  N  =  4.  No 
change  should  be  made  except  as  part  of  an  explicit  revision  of  the  plan. 

(13)  All  measures  of  quality  based  on  the  bombing  results  should  be 
algebraic  functions  of  the  coordinates  of  the  bombs  with  respect  to  the 
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target,  taken  with  their  proper  algebraic  signs.  Absolute  values  and 
mean  deviations  are  not  algebraic  functions  and  should  not  be  employed 
in  this  connection.  Use  of  criteria  based  on  mean  deviations  requires 
more  bombing  to  procure  comparable  accuracy  than  the  formulas  used 
herein.  Furthermore,  in  contrast  to  the  T  measures  here  recommended, 
no  combination  of  mean  deviations  has  an  exactly  known  distribution. 

» 

2.2.  Control  Charts 

2.2.1.  General  Plan. — Seven  control  charts,  illustrated  by  Figs.  3.1  to 
3.7,*  are  to  be  kept  for  each  factory  and  model  and  for  the  A  and  B  series. 
On  these  charts  certain  indicators  of  quality  of  successive  lots  are  plotted, 
strictly  in  the  order  of  production.  Entries  on  these  charts  should  be 
made  daily  as  soon  as  the  data  become  available.  The  charts  should  be 
posted  conspicuously  at  the  proving  ground  where  they  will  be  seen  by  all 
concerned  with  the  testing.  Copies  of  them  should  be  sent  weekly  to 
the  procurement  authorities,  to  the  particular  manufacturer  concerned, 
and  to  the  government  inspector  at  the  factory.  The  charts  should  all 
be  drawn  with  the  same  horizontal  scale,  representing  the  order  of  the  lot 
represented,  and  should  be  placed  one  above  another  with  a  clear  indica- 
tion of  the  boundaries  between  them  when  this  is  practicable.  Space 
should  be  left  at  the  bottom  for  comments  on  exceptional  cases  and  a 
notation  of  action  taken  with  respect  to  such  cases. 

The  quantities  represented  on  the  charts,  from  top  to  bottom  in 
order,  will  be  designated  by  T0,  TD,  TM,  sr,  f,  sd,  d.  The  highest  of  these, 
To,  is  the  most  comprehensive  measure  of  the  bombing  errors  associated 
with  the  sights  under  consideration,  and  anyone  who  has  time  to  consider 
only  one  measure  should  watch  T0.  The  next  two  break  down  the  over- 
all error  into  a  part  representing  deviations  from  the  mean  point  of  impact 
(MPI)  of  the  bombs  on  a  flight  and  a  part  representing  deviations  of  the 
MPI  from  the  target.  All  these  first  three  measures  are  combinations 
of  range  errors  with  deflection  errors.  Of  the  remaining  four  quantities 
charted,  two  refer  to  range  errors  alone  and  two  to  deflection  errors  alone, 
two  to  the  MPI  and  two  to  deviations  from  the  MPI. 

Any  general  tendency  to  slipshodness  and  poor  work  in  a  factory  is 

*  These  illustrative  charts  were  made  up  from  the  actual  bombing  data  discussed 
in  Sec.  5  and  are  thus  mutually  consistent.  Though  these  records  of  nights  were  all 
derived  from  different  sights,  they  were  arbitrarily  grouped  into  consecutive  pairs, 
representing  fictitious  single  sights,  and  these  pairs  were  again  grouped  into  consecu- 
tive pairs  representing  samples  from  hypothetical  lots,  just  as  if  the  plan  here  pre- 
sented had  been  in  effect.  The  last  flight  was  omitted  so  as  to  leave  64  flights,  a 
multiple  of  4,  as  is  required  by  this  plan.  The  result  is  to  give  a  reasonably  good 
illustration  of  the  kind  of  charts  that  can  be  expected  in  test  bombing  of  the  kind 
carried  out  with  bombsights  of  the  same  general  quality  as  the  ones  then  tested. 
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more  likely  to  be  observed  in  rising  curves  in  the  upper  charts,  approach- 
ing and  passing  the  control  limits,  whereas  light  on  specific  faults  is  to  be 
sought  by  casting  the  eye  downward  gradually  to  the  lower  charts. 

The  seven  quantities  to  be  charted  will  be  defined  in  the  following 
paragraphs,  beginning  with  the  simplest.  They  will  therefore  be  in  a 
different  order  from  that  above.  The  actual  ordinates  are  not  propor- 
tional to  the  quantities  mentioned  above,  except  f  and  d,  but  to  their 
logarithms.  Consequently,  arithlog  (that  is,  semilogarithmic)  paper 
should  be  used  for  the  top  five  charts  to  save  the  trouble  of  looking  up 
logarithms.  Also,  the  vertical  scales  on  these  five  top  charts  are  to  be 
marked  on  the  left  with  numerical  values  equal  to  the  squares  of  the  T"s 
and  s's  since,  as  will  be  seen  later,  this  avoids  the  labor  of  extracting 
square  roots.  The  T's  and  s's  themselves  are  given  on  vertical  scales  at 
the  right  of  these  charts. 

We  shall  use  r  and  d  as  symbols  for  range  and  deflection  errors, 
respectively,  in  mils,  *  and  indicate  by  S  the  sum  over  the  N  ( = 4)  bombs 
in  a  flight  of  the  values  of  the  expression  following  S. 

All  the  numerical  coefficients,  yardstick  lengths,  and  control  limits 
herein  specified  should  be  revised  periodically  as  further  experience 
accumulates;  they  will  especially  need  to  be  revised  after  any  change  in 
technique  has  been  in  effect  long  enough  to  provide  a  sufficient  basis  in 
experience.     Formulas  for  revision  will  be  found  in  Sec.  3. 

2.2.2.  Control  Chart  for  Mean  Range. — The  control  chart  for  f,  illus- 
trated by  Fig.  3.1,  represents  fF  =  Sr/N  for  each  flight  by  a  cross  or  a 
circle.  For  each  lot  the  values  of  fF  found  from  the  two  flights  of  the 
sight  produced  earlier  than  the  other  are  represented  by  circles,  while 
those  for  the  sight  produced  later  are  represented  by  crosses.  The  mean 
of  these  four  values  of  fF  is  fL,  the  mean  of  the  range  errors  for  the  lot. 
The  points  representing  fL  for  successive  lots  are  connected  by  straight 
lines. 

Control  limits  are  shown  as  two  pairs  of  horizontal  lines.  The  prob- 
ability that  a  lot  of  sights  constituting  a  random  sample  from  the  same 
population  as  the  sights  tested  in  the  base  period,  with  the  same  distribu- 
tion of  errors  of  all  kinds  as  in  the  test  period,  should  be  represented  by 
a  point  outside  the  outer  control  limits  is  0.01.  The  probability  that  such 
a  lot  should  yield  a  value  of  fL  falling  outside  the  inner  control  limits 
under  the  same  conditions  is  0.05.  The  upper  and  lower  limits,  corre- 
sponding to  each  probability  are  at  equal  distances,  not  from  the  zero 
line,  but  from  a  line  representing  the  range  error  in  the  MPI  of  all  the 
bombs  dropped  in  the  base  period.  This  point  is  discussed  in  Sec.  3.2, 
where  the  formulas  and  numerical  values  for  the  control  limits  are  also 

*  A  mil  is  an  angular  measure  approximately  equal  to  1/6,400  of  360  degrees  or 
to  1/1,000  of  a  radian,  the  exact  definition  varying  from  one  usage  to  another. 
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given.     The  0.01  limits  are  represented  by  solid  lines  at  5.89  and  —3.20 
mils;  the  0.05  limits  are  shown  by  lines  of  dashes  at  4.80  and  —2.11  mils. 

If  the  zigzag  line  should  get  outside  the  band  marked  out  by  the  inner 
control  limits,  this  fact  may  be  taken  as  a  warning  to  be  alert  for  any 
further  indication  of  faults  tending  to  make  bombs  fall  consistently  too 
far  short  of  or  too  far  beyond  the  target,  as  the  case  may  be.  If  the  outer 
control  limits  should  be  surpassed,  the  warning  is  more  emphatic,  since 
this  will  happen  only  once  for  100  lots  if  there  are  only  the  same  sources  of 
range  error  as  in  the  base  period.  An  instance  of  exceeding  the  outer 
control  limits  should  be  made  the  occasion  of  an  investigation  at  the 
proving  ground  to  determine  the  cause.  A  definite  trend  upward  or 
downward  in  the  plotted  quantity  is  another  indication  of  a  dangerous 
condition,  even  before  the  inner  control  limits  are  reached.  In  such 
contingencies,  unless  the  trend  or  the  excessive  value  of  fL  is  definitely 
attributed  to  faults  of  the  airplane,  personnel,  or  other  cause  independent 
of  the  quality  of  the  sight,  the  manufacturer  should  be  warned  promptly. 
It  is  particularly  important  to  give  early  warning  of  trends  in  order  that 
there  may  be  an  opportunity  to  correct  gradually  developing  faults  before 
they  become  serious.  Action  to  be  taken  when  control  limits  are  passed 
is  specified  further  in  Sec.  2.3. 

The  control  limits  refer  to  fL  but  serve  a  secondary  purpose  also  with 
respect  to  fs,  the  mean  of  the  range  errors  for  a  sight,  represented  by  a 
point  that  is  ordinarily  not  marked  but  that  can  be  estimated  visually 
from  a  pair  of  crosses  or  of  circles.  The  control  limit  having  probability 
0.01  of  being  exceeded  by  fL  has  a  probability  approximating  0.20  of  being 
exceeded  by  fs,  and  that  having  probability  0.05  for  fL  has  probability 
practically  0.33  for  fs,  provided  the  number  of  bombs  dropped  in  the  base 
period  is  large,  say  200  or  more.  Thus  the  fluctuations  of  range  error  in 
MPI  can  be  followed  visually  with  a  fair  idea  of  the  relevant  probabilities. 

To  facilitate  further  the  continuing  comparison  of  means  of  range 
errors,  a  yardstick  is  provided  at  the  left  for  the  difference  between  the 
means  of  two  lots,  a  longer  yardstick  for  the  difference  between  the  means 
of  two  sights,  and  a  still  longer  yardstick  for  the  difference  between  the 
means  of  two  flights.  The  vertical  distance  between  points  representing 
means  of  any  of  these  types  should  exceed  the  length  of  the  corresponding 
yardstick  about  once  in  20  times  if  chance  alone  is  operating.  The 
means  for  the  sights  are  not  shown  but  are  easily  inferred  as  the  centers 
of  gravity  of  the  two  points  representing  the  flights  with  the  individual 
sight.  A  visual  appraisal  of  the  distance  between  these  centers  of  gravity 
for  the  two  sights  representing  a  lot,  in  comparison  with  the  middle  yard- 
stick, will  usually  indicate  clearly  whether  the  difference  in  performance 
in  this  respect  is  unusually  great.  An  excessive  difference  should  be 
regarded  as  a  tentative  indication  of  some  variation  of  conditions  at  the 
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factory  during  the  production  of  the  lot  represented  by  this  sample. 
However,  it  might  also  be  explained  by  a  difference  in  weather  or  per- 
sonnel conditions  between  the  two  days  on  which  one  sight  was  tested 
and  the  two  days  on  which  the  other  sight  was  tested;  or  by  chance  alone 
for  every  twenty  such  comparisons  if  these  were  to  be  made  for  all  pairs. 
The  yardstick  lengths  on  the  f  chart  are  9.47,  6.70,  and  4.74  mils  for 
flights,  sights,  and  lots,  respectively. 

A  fallacy  that  must  be  guarded  against  in  connection  with  the  yard- 
sticks is  a  tendency  to  select  the  greatest  and  the  least  of  a  series  of  obser- 
vations and  to  regard  them  as  significantly  different  if  their  heights 
differ  by  more  than  the  length  of  the  yardstick.  This  is  wrong  and 
likely  to  lead  to  far  too  many  judgments  of  significant  difference,  because 
the  yardstick  lengths  are  determined  on  the  assumption  that  they  will 
be  used  for  comparing  only  observations  selected  in  a  manner  independ- 
ent of  their  magnitudes.  It  is  perfectly  legitimate  to  use  the  yardstick 
to  compare  consecutive  observations,  or  the  first  and  last  of  a  consider- 
able series,  though  even  so  it  must  be  remembered  that  significance  will 
emerge  fallaciously  about  once  in  20  times.  The  detection  of  trends 
in  quality  should  be  facilitated  by  applying  the  yardsticks  to  observa- 
tions a  considerable  distance  apart.  Discontinuous  changes  may  become 
evident  when  two  consecutive  observations  differ  by  an  amount  approxi- 
mating the  length  of  the  yardstick,  provided  earlier  observations  are 
fairly  well  in  accord  with  the  first  of  the  pair  and  later  ones  with  the 
second.  It  is  possible  to  devise  exact  statistical  criteria  for  both  trends 
and  discontinuities,  but  workers  in  quality  control  stress  the  value  of 
graphic  devices. 

2.2.3.  Control  Chart  for  Mean  Deflection. — A  control  chart  for  d 
(Fig.  3.2)  should  be  kept  in  exactly  the  same  way  as  that  for  f.  How- 
ever, the  control  limits  will  be  farther  apart  in  the  case  of  the  d  chart 
and  the  yardsticks  longer  because  the  variance  of  deflection  errors  is 
greater  than  that  of  range  errors.  The  0.01  control  limits  are  at  8.40 
and  —2.79  mils;  the  0.05  limits  are  at  7.05  and  —1.44  mils.  The  yard- 
stick lengths  are  11.65,  8.24,  and  5.83  mils. 

2.2.4.  Control  Chart  for  Range  Variance. — For  each  flight  the  esti- 
mated variance  of  range  errors  is  defined  by  the  formula 


s; 


8(r  -  f)'- 


N  -  1 
but  may  be  found  with  a  slight  economy  of  labor  by  using  the  fact  that 

S(r  -  f)2  =  Sr2  -  fFSr 

The  control  chart  for  range  variance  is  to  be  on  logarithmically  ruled 
paper  (Fig.  3.3).     Values  of  s2.  for  the  four  flights  representing  each  lot 
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of  bombsights  are  plotted  on  the  same  vertical  line,  two  circles  for  that 
one  of  the  sample  sights  of  the  lot  produced  earlier,  and  two  crosses  for 
the  other.  The  arithmetic  mean  of  these  four  values  of  s*  is  the  best 
estimate  of  range  variance  for  this  lot  of  bombsights.  The  points  repre- 
senting such  means  (which,  because  of  the  logarithmic  scale,  are  slightly 
above  the  centers  of  gravity  of  the  points  representing  individual  values 
of  sj)  are  connected  by  a  zigzag  line.  Only  a  single  control  limit  is 
plotted  in  this  case  for  each  probability  considered,  since  it  is  not  neces- 
sary to  be  on  guard  against  unusually  small  variances.  When  N  =  4, 
the  variance  estimate  for  a  lot  has  12  degrees  of  freedom  (3  for  each  of 
the  4  flights).  The  lot  variance  for  range  that  will  be  exceeded  by 
chance  once  in  20  times  is,  according  to  the  estimate  supplied  by 
our  preliminary  sample,  82.87,  and  this  is  represented  by  the  height  of 
the  control  limit  marked  0.05  in  Fig.  3.3.  The  corresponding  value  for 
probability  0.01  is  104.4  (see  Sec.  3.3.1  for  the  origin  of  these  values). 

It  should  particularly  be  observed  that  the  denominator  of  each  of 
the  estimates  of  variance  here  used  equals  the  number  of  degrees  of 
freedom,  not  the  sample  number.  Using  the  sample  number,  as  is 
often  done,  introduces  unnecessary  complications  into  the  determination 
of  the  probabilities,  and  if  applied  in  the  present  case  is  likely  to  lead  to 
large  errors. 

Yardsticks  having  functions  similar  to  those  of  Figs.  3.1  and  3.2  are 
provided  at  the  left  of  Fig.  3.3.  The  logarithmic  vertical  scale  of  Fig. 
3.3  makes  possible  the  use  of  these  yardsticks,  which  are  to  be  regarded 
as  subject  to  rigid  translation.  The  exact  comparison  of  estimates  of 
variance  in  terms  of  probability  must  be  made  by  means  of  a  function 
of  their  ratio,  or  the  difference  of  their  logarithms.  This  is  the  reason 
for  using  a  logarithmic  chart  for  s%  In  the  charts  for  f  and  d  an  arith- 
metic scale  was  used,  since  the  comparison  between  two  means  must 
be  made  in  terms  of  their  difference,  not  their  ratio.  The  lengths  of 
the  yardsticks  for  flights,  sights,  and  lots  correspond,  respectively,  to 
the  variance  ratios  15.45,  5.82,  and  3.28. 

To  illustrate  the  use  of  these  yardsticks,  suppose  that  s2r  should 
have  the  value  10  for  one  lot  and  40  for  the  next.  Both  points  are  well 
within  the  control  limits.  But  the  ratio  4  exceeds  the  yardstick  3.28, 
as  is  immediately  obvious  when  the  points  are  charted,  to  such  an  extent 
that  the  probability  is  considerably  less  than  0.05  of  the  variances  deviat- 
ing so  far  from  equality  by  chance  alone.  Unless  some  explanation 
connected  with  the  testing  is  forthcoming,  this  is  to  be  regarded  as  a 
danger  signal  suggesting  deterioration  of  the  sights  in  a  manner  leading 
to  wider  scatter  in  range.  The  two  sights  belonging  to  the  lot  showing 
the  variance  40  should  be  examined  for  sources  of  excessive  range  vari- 
ance such  as  loose  rate  knobs,  and  any  explanation  found  should  be  indi- 
cated in  the  space  below  the  control  charts  reserved  for  notes. 
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2.2.5.  Control  Chart  for  Deflection  Variance. — The  variance  of  deflec- 
tion errors  is  to  be  treated  in  exactly  the  same  way,  except  that  the 
control  limits  are  in  this  case  at  125.4  and  157.9.  The  yardstick  lengths 
are  the  same  as  for  the  range-error  variance  chart  (see  Fig.  3.4). 

2.2.6.  Control  Charts  Combining  Range  with  Deflection. — The  indi- 
cators of  quality  to  be  used  combining  range  and  deflection  errors  are 
denoted  by  T  with  the  following  subscripts : 

0  for  an  over-all  measure 

M  for  a  measure  based  only  on  the  mean  point  of  impact  (MPI) 

D  for  a  measure  based  on  deviations  from  the  MPI 

A  second  subscript  will  also  be  used  when  needed  to  make  the  meaning 
clear,  as  follows : 

B  for  an  individual  bomb 

F  for  all  the  bombs  dropped  in  a  flight 

S  for  the  bombs  dropped  with  the  same  sight 

L  for  the  bombs  dropped  with  the  sights  representing  a  lot 

It  will  frequently  be  convenient  to  omit  the  second  subscript. 

The  definitions  of  the  various  T"s  all  make  use  of  the  same  coeffi- 
cients a,  b,  c,  whose  values  may  be  taken  at  first,  on  the  basis  of  the 
experience  analyzed  in  Sec.  5,  as  follows: 

a  =  0.021  724         b  =  0.001  331         c  =  0.014  360 

However,  these  values,  and  the  corresponding  control-chart  limits,  should 
be  revised  periodically  as  further  experience  accumulates.  They  will 
especially  need  to  be  revised  after  any  change  in  technique  of  inspection 
has  been  in  effect  long  enough  to  provide  a  sufficient  basis  in  experience. 
Methods  of  revising  them,  and  the  other  numerical  quantities  used  in 
this  part,  are  described  in  Sec.  3.1.3. 

T20  is  defined  for  an  individual  bomb  as 

TfB  =  ar2  +  2brd  +  cd2 

and  for  a  flight,  sight,  or  lot  as  the  sum  of  the  values  of  T%B  for  the  indi- 
vidual bombs  involved.  However,  it  will  only  be  in  exceptional  cases 
that  T20B  will  actually  be  computed,  and  ordinarily  it  will  be  convenient 
to  computed  T20F  by  the  formula 

T2F  =  aSr2  +  2bSrd  +  cSd2 

where,  as  before,  S  stands  for  summation  over  all  the  bombs  dropped  on 
the  flight.  For  a  sight  or  a  lot  the  value  of  T20  is  found  by  adding  together 
its  values  for  the  several  flights  involved. 
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T2M  might  be  defined  for  a  flight  by  the  formula 

T2MF  =  N(a?l  +  2bfFdF  +  cdF) 

but  is  more  easily  computed  from  the  equivalent  formula 

T2MF  =  afFSr  +  2bfFSd  +  cdFSd 

For  a  sight  or  a  lot  the  value  of  T2M,  like  that  of  T20,  is  found  by  adding 
up  the  values  for  the  flights  involved. 

In  every  case  T20  is  greater  than  T\.     The  difference  is  defined  as 
T%.     An  algebraic  identity  shows  that 


rp2        7^2       f2 

1  DF    ~     L  OF  1  MF 


=  aS(r  -  fF)2  +  2bS(r  -  fF)(d  -  dF)  +  cS(d  -  dF) 


while  for  a  sight  or  a  lot  the  value  of  T2D  is  the  sum  of  those  for  the  flights 
involved.  Thus  in  every  case  T%  depends  only  on  the  deviations  from 
the  mean  points  of  impact  for  the  respective  flights,  whereas  T2M  depends 
only  on  these  mean  points  of  impact. 

The  direct  use  of  TM  and  T0  will  be  appropriate  only  when  biases 
such  as  those  disclosed  by  the  statistical  analysis  of  Sec.  5  have  either 
been  eliminated  or  definitely  attributed  to  the  bombsight  itself  rather 
than  anything  in  the  testing  process.  Until  such  elimination  is  achieved 
or  such  a  judgment  is  reached,  it  is  better  to  modify  TM  and  T0  so  as  to 
make  them  functions  of  deviations,  not  from  the  target,  but  from  the 
mean  point  of  impact  of  the  bombs  of  the  base  period.  Indicating  this 
correction  by  placing  a  caret  over  T,  and  using  f'  and  d'  for  the  coordi- 
nates of  the  mean  point  of  impact  of  the  bombs  of  the  base  period,  gives 

^      _  aS(r  -  f)2  +  2bS(r  -  ?'){d  -  d')  +  cS(d  -  d')2  _  T20F  +  U 

■*■  OF    ~  1  1 

1    +  A  1    +  -, 

n  n 

fa     _  N  [a(fF  -  f'Y  +  2b(fF  -  f')(dF  -  df)  +  c{dF  -  d')2] 

1  MF    —  AT 

nF  +  u 

n 
where  n'  is  the  number  of  bombs  in  the  base  period  and 

U  =  N{af'2  +  2bf'd'  +  cd'2)  -  2[af'Sr  +  b(f'Sd  +  d'Sr)  +  cd'Sd] 

The  numerical  values  obtained  from  the  260  bombs  discussed  in  Sec.  5 
give,  when  N  =  4, 

U  =  0.64917  -  0.065903£r  -  0.084129£d 
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For  a  sight  or  a  lot,  corrected  values  f%  or  f2M  are  found  by  adding 
together  the  corresponding  corrected  values  for  the  flights  involved. 
Values  of  TD  are  not  affected  by  the  correction. 

For  f2M,  T2D,  and  f2,  control  charts  are  to  be  maintained  on  arithlog 
paper  in  a  similar  form  to  those  for  s2r  and  si  (see  Figs.  3.5,  3.6,  and  3.7). 
However,  there  is  one  difference  in  form:  The  zigzag  line  on  each  T 
chart  plots  the  sums  of  the  quantities  represented  by  circles  and  crosses, 
instead  of  the  means  as  on  the  s2  charts.  This  results  in  all  the  quantities 
plotted  being  of  kinds  customary  in  statistical  theory,  and  hence  easily 
looked  up  in  probability  tables,  at  least  to  an  approximation.  A  further 
advantage  of  this  arrangement  is  that  it  renders  the  appearance  of  the  T 
charts  so  unlike  that  of  the  s  charts  that  they  cannot  be  confused. 

On  each  of  the  T  charts  there  is  for  each  of  the  probabilities  0.05  and 
0.01  an  upper  but  not  a  lower  control  limit.  The  three  yardsticks  sup- 
plied in  each  case  refer  to  differences,  on  a  logarithmic  scale,  of  sums  for 
flights,  sights,  and  lots,  respectively,  with  probability  0.05.  The  values 
of  T2  for  the  control  limits  and  yardsticks  are  given  in  the  following 
table : 


0.05  control  limit 
0.01  control  limit 
0.05  yardsticks: 

Flights 

Sights 

Lots 


T2M  or  f  1 


16.3 
21.2 

39.0 

9.6 
4.4 


41.2 
49.4 

5.8 
3.3 
2.3 


T%  or  fl 


54.4 
64.0 

4.4 
2.8 
2.0 


These  values  require  a  small  correction  of  a  purely  mathematical 
kind  based  on  research  not  yet  completed. 

2.2.7.  Illustration  of  the  Use  of  Control  Charts.* — A  glance  at  Fig.  3.1 
shows  that  all  but  two  of  the  lots  were  well  within  the  control  limits  for 
range  MPI.  Lot  9  falls  slightly  outside  the  0.05  limit  at  what  may  be 
estimated  as  the  0.04  level.  Since  there  is  a  reasonably  large  chance  of 
1  out  of  15  or  16  cases  exceeding  the  0.05  limit,  and  since  lot  9  shows  no 
shortcomings  on  the  other  charts,  this  deviation  may  be  disregarded. 

A  very  different  situation  is  presented  by  lot  2.  As  Fig.  3.1  shows,  the 
MPI  was  in  this  case  an  average  of  nearly  6  mils  short,  a  distance  far 
beyond  even  the  0.01  control  limit.  Following  this  lot  through  the  other 
control  charts,  we  observe  from  Fig.  3.2  that  its  deflection  error  in  MPI 
was  very  moderate,  and  from  Fig.  3.4  that  the  deflection  variance  was 
exceptionally  small.     From  Fig.  3.3  it  appears  that  the  range  variance 

*  This  section  is  for  illustrative  purposes  only  and  is  based  on  data  put  together 
artificially,  as  explained  in  the  footnote  in  Sec.  2.2.1  and  in  Sec.  5. 
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was  far  below  the  average.  Moreover,  in  Fig.  3.3  the  circles  and  crosses 
for  lot  2  are  all  remarkably  close  together,  indicating  consistently  low 
variance,  that  is,  high  accuracy  in  hitting  closely  around  a  point  6  mils 
short  of  the  target.  Figure  3.5  shows  that  the  range  error  in  MPI  is  too 
great  to  be  concealed  by  averaging  with  the  small  deflection  error.  Upon 
looking  at  Fig.  3.6,  it  is  apparent  that  lot  2  yielded  the  smallest  value  of 
TD  of  all  the  16  lots  plotted,  thus  demonstrating  a  rather  remarkable 
accuracy  in  hitting  the  wrong  point.  The  error  in  MPI  and  the  close 
clustering  about  it  offset  each  other  sufficiently  so  that  the  bombs  of  lot  2 
are  only  moderately  far  from  the  target  in  terms  of  To,  so  that  Fig.  3.7 
shows  nothing  remarkable. 

This  combination  of  peculiarities  might  be  brought  about  by  a 
bombardier  with  very  quick  reactions,  enabling  him  to  keep  the  airplane 
on  a  remarkably  good  course  and  to  maintain  regularity  in  his  behavior, 
but  causing  him  to  release  the  bomb  consistently  too  soon.  Such  a  man 
should  be  capable  of  development  into  an  exceptionally  accurate  bombar- 
dier if  he  studied  the  control  charts  and  particularly  his  own  errors. 

2.3.  Action  When  Results  Are  Bad 

If  in  any  case  the  zigzag  line  crosses  its  1  percent  control  limit,  an 
investigation  should  be  made  of  the  cause  of  this  anomaly,  and  the 
probable  cause  entered  in  the  space  provided  for  the  purpose  at  the 
bottom  of  the  control  chart.  If  the  investigation  includes  further  air 
testing  following  shop  adjustment,  the  same  seven  quantities  should  be 
computed  for  the  new  air  tests  and  their  significance  judged  by  means  of 
the  probability  formulas  of  Sec.  3.  Entries  of  probable  cause  should 
indicate  clearly  whether  the  excessive  bombing  errors  bringing  about  the 
anomaly  are  believed  to  indicate  a  fault  in  the  sight  or  in  some  feature 
of  the  testing  process.  Pure  chance  may  be  blamed  approximately  once 
for  every  hundred  lots  from  which  sample  sights  are  tested ;  this  allotment 
of  1  anomaly  in  100  applies  separately  to  each  of  the  control  charts. 

Some  clues  as  to  the  nature  of  the  trouble  will  be  provided  by  an 
intelligent  reading  of  the  control  chart  itself,  with  attention  to  the  con- 
sistency of  performance  of  the  two  sights  in  a  lot  and  of  the  two  flights 
with  each  sight,  and  observing  whether  the  trouble  is  in  range  or  deflec- 
tion, or  both,  and  whether  it  introduces  a  bias  into  the  MPI  or  increases 
the  dispersion  about  the  MPI.  To  facilitate  such  inquiries,  a  list  of  the 
types  of  defect  likely  to  lead  to  excessive  deviations  in  each  of  the  seven 
control-chart  quantities  may  be  used  (see  that  below,  p.  140). 

If  the  0.01  control  limits  are  exceeded  repeatedly,  the  investigation 
should  be  extended  to  the  factory.  Intensified  shop  inspection  should 
also  be  applied.  In  extreme  cases  the  remaining  sights  in  a  lot  repre- 
sented by  defective  sights  should  be  recalled  or  repaired.     If  this  happens, 


138  TECHNIQUES  OF  STATISTICAL  ANALYSIS  Chap.  3 

the  shipment  of  further  sights  prior  to  completion  of  air  testing  of  samples 
should  be  suspended  until  the  factory  has  again  established  a  satisfactory 
record;  and  the  factory  should  revert  to  the  status  of  a  new  factory,  which 
implies  that  40  consecutive  sights  should  successfully  pass  the  air  test. 
Exact  criteria  as  to  when  this  should  be  done  will  need  to  be  formulated  in 
the  future  on  the  basis  of  further  experience. 

When  the  rate  of  production  of  bombsights  is  to  be  reduced,  the 
average  showing  of  a  factory  on  the  control  charts  should  be  taken  into 
consideration,  in  comparison  with  that  of  other  factories.  The  average 
value  of  T20  is  probably  the  best  criterion  that  can  be  drawn  from  the 
coordinates  of  the  bombs  dropped  in  air  testing  for  deciding  which  factory 
should  be  eliminated. 

The  control  chart  should  not,  however,  be  looked  upon  primarily  with 
a  view  to  drastic  action.  Its  great  value,  demonstrated  in  other  indus- 
tries, is  in  detecting  trends  in  quality  and  calling  attention  to  deteriora- 
tion while  it  is  still  incipient  and  before  it  becomes  serious.  When  this 
value  is  recognized,  the  attitude  of  manufacturers  changes  from  hostility 
to  appreciation.  In  order  that  these  desirable  results  be  realized,  it  is 
essential  that  copies  of  control  charts  be  transmitted  promptly  to  manu- 
facturers, and  that  attention  be  called  to  peculiarities  revealed  by  them. 
Such  peculiarities  include  not  only  the  crossings  of  the  0.01  limit  that  call 
for  investigations  or  more  serious  action,  but  also  cases  of  exceeding 
the  0.05  limit  more  than  about  once  in  20  times,  or  of  progressive 
drifting  up  toward  the  control  limits,  or  of  excessive  variation  between  the 
performances  of  the  same  sight  or  of  two  sights  from  the  same  lot,  as 
indicated  by  the  yardsticks.  The  meaning  of  each  of  the  seven  control- 
chart  indicators  should  be  made  clear  to  all  concerned. 

The  yardsticks  are  designed  to  facilitate  visual  discernment  of  tenden- 
cies to  drift  before  the  control  limits  are  reached  and  also  to  help  in 
observing  abrupt  changes.  A  little  practice  and  familiarity  with  this 
type  of  chart  should  make  it  possible  to  detect  such  changes  more 
promptly  than  would  otherwise  be  easy.  Prompt  detection  of  trends 
and  anomalous  tendencies  can  be  of  great  value  if  the  factory  is  notified 
at  once,  since  trouble  can  often  be  found  and  corrected  in  this  way  before 
it  becomes  serious.  It  is  to  the  interest  of  the  government  and  the 
manufacturer  alike  to  prevent  serious  trouble  by  early  corrective  action 
of  this  sort,  rather  than  to  wait  for  it  to  develop  to  the  point  at  which 
bombsights  will  be  rejected. 

Of  all  the  indicators,  To  is  the  one  likely  to  give  the  sharpest  dis- 
crimination between  cases  in  which  sights  are  really  defective  or  the 
bombing  for  any  reason  poor,  and  on  the  other  hand  cases  of  pure  bad 
luck.  It  combines  the  evidence  from  all  the  others  and  can  serve  as  the 
deciding  agent  in  marginal  cases  in  which  the  others  seem  to  disagree  on 


Sec.  2.3  MULTIVARIATE  QUALITY  CONTROL  139 

the  question  whether  the  bombing  was  good  or  bad.  In  a  more  spe- 
cialized way,  TM  combines  the  evidence  of  f  and  d,  while  TD  combines 
that  of  sr  and  Sd.  It  is  for  diagnosis  of  the  trouble  rather  than  for  evi- 
dence of  its  existence  that  the  main  reliance  should  be  placed  on  the  charts 
for  means  and  variances. 

Investigations  prompted  by  excessive  control-chart  deviations  will 
be  assisted  by  a  list  of  possible  causes  under  each  of  the  seven  control- 
chart  measures.  Such  a  list  is  best  drawn  up  by  persons  in  continuing 
close  relation  to  the  manufacture,  testing,  and  use  of  the  sights  and  may 
need  modifications  from  time  to  time  with  changes  of  model  and  in  the 
testing  procedure.  The  list  below  is  purely  tentative  and  needs  to  be 
supplemented  by  further  suggestions  drawn  from  personnel  concerned 
with  manufacture,  shop  testing,  air  testing,  and  maintenance. 

In  order  that  such  suggestions  be  made  intelligently,  it  is  necessary 
that  the  personnel  involved  have  some  general  understanding  of  the 
meaning  of  each  of  the  seven  measures.  The  following  definitions  may 
be  sufficient  for  most  of  those  concerned: 

f     Range  coordinate  of  mean  point  of  impact 
d     Deflection  coordinate  of  mean  point  of  impact 
sr     A  measure  of  dispersion  in  range 
Sd     A  measure  of  dispersion  in  deflection 
TM     A  measure  of  distance  from  mean  point  of  impact  to  target, 
modified  in  accordance  with  probability  principles  to  allow  for 
the  greater  deflection  variance 
TD     A  measure  of  dispersion  about  the  MPI  combining  range  and 

deflection  errors 
To     An  over-all  measure  of  errors  of  all  kinds 

It  should  be  understood  that  each  of  the  T  measures  combines  range 
with  deflection  errors,  and  that  T0  combines  all  the  other  measures  into 
one.  If  a  particular  defect  in  a  sight,  or  in  the  testing  process,  can  affect 
only  range  errors  it  is  more  likely  to  be  detected  through  f  or  sr  than 
through  any  of  the  measures  that  include  deflection  errors.  In  the  same 
way,  if  a  particular  defect  can  affect  only  deflection  errors,  it  is  more 
likely  to  be  found  from  the  d  or  sd  chart  than  from  any  of  the  T  charts. 
However,  any  defect  that  causes  both  range  and  deflection  errors  may 
sometimes  be  detected  through  one  of  the  combined  T  measures  when  it 
would  be  impossible  to  distinguish  it  from  mere  chance  on  the  basis  of 
the  charts  of  range  or  deflection  errors  alone.  Similarly,  any  cause 
affecting  both  MPI  and  variance  can  sometimes  be  brought  out  clearly 
by  To  when  none  of  the  others  would  make  it  definite. 

In  making  up  such  a  list,  it  appears  better  to  consider  the  various 
possible  sources  of  trouble  and  mark  each  with  one  or  more  of  the  seven 


140 


TECHNIQUES  OF  STATISTICAL  ANALYSIS 


Chap.  3 


symbols,  later  rearranging  to  get  lists  in  the  form  below,  than  to  begin 
with  the  seven  control-chart  measures  and  try  to  think  of  all  the  possible 
causes. 

The  following  partial  and  tentative  list  should  be  understood  in  the 
terms  of  the  preceding  remarks.  Under  each  control-chart  measure  are 
listed  several  causes,  any  one  of  which  might  lead  to  an  excessive  value 
for  the  measure.  Each  factor  is  listed  only  under  the  measure  or  meas- 
ures most  likely  to  lead  to  its  detection,  though  it  may  also  have  minor 
effects  on  other  measures  and  will  be  reflected  in  more  comprehensive 
measures. 


Grease  on  disk 
Worn  disk 
Faulty  trail  plate 
Release  mechanism  slow  or  fast 
Bombardier's  reaction  slow  or  fast 
Loose  telescope  cable 
Faulty  sight  gyro 
Wrong  disk  speed 
Loose  altitude  knob 
Slipping  of  scale  on  altitude  drum 
Faulty    or    wrongly    adjusted    telescope 
drive  governor 


Faulty  directional  gyro 
Stabilized  sector  off  center 
Pilot  director  scale  displaced 
Pilot  director  rheostat  inaccurate 
Pilot  director  brush  worn 
Crosstrail  mechanism  inaccurate 


Erratic  release  mechanism 

Erratic  behavior  of  bombardier 

Grease  on  disk 

Roller  or  disk  not  smooth 

Roller  not  accurately  centered 

Loose  telescope  cable 

Loose  range  knobs 

Loose  altitude  knob 

Faulty  sight  gyro 

Faulty  telescope  drive  motor 


Loose  course  knobs 
Faulty  directional  gyro 
Stabilized  sector  off  center 
Slow  reaction  by  airplane 
Slow  reaction  by  pilot 


TM 

of 


factors    affecting 


Any    combination 
f  and  d 

Wrong  altitude 

Wrong  air  speed 

Faulty  direction  of  optical  axis  of  tele- 
scope 

Sight  gyro  inaccurately  leveled,  perhaps 
because  of  inaccurate  bubbles 


of 


factors    affecting 


Any    combination 

sr  and  Sd 
Loose  altitude  knob 
Sticky  bearings 
Variable  winds 
Poor  visibility 
Bumpy  air 

To 

Any  combination  of  factors  affecting  two 
or  more  of  the  other  measures 

General  slipshodness  or  lack  of  skill  in 
either  manufacture  or  testing 

Variable  winds 

Faulty  sight  gyro 


2.4.  Experimentation 

This  plan  may  readily  be  adapted  for  efficient  combination  with 
experiments  for  determining  the  relative  excellence  of  different  models 
of  sights  and  types  of  bombing,  or  for  investigating  other  questions  relat- 
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ing  to  bombing.  Considerable  economy  in  the  total  amount  of  bombing 
required  should  be  obtainable  by  a  proper  design  for  experimenting 
simultaneously  on  various  types  of  equipment  and  technique  while  at 
the  same  time  carrying  out  the  routine  test  bombing  with  full  accuracy. 
The  amount  of  bombing  required  will  be  greater  than  for  the  routine 
testing  alone  or  for  the  experiment  alone,  but  less  than  the  sum  of  the 
two. 

To  ensure  valid  results  from  such  combined  enterprises,  it  is  important 
to  verify  before  the  experiment  is  carried  out  that  it  satisfies  the  criteria 
of  the  modern  statistical  theory  of  design  of  experiments.  It  may  be 
difficult  or  impossible  to  salvage  any  useful  information  from  the  results 
of  an  improperly  designed  experiment.  Many  designs  are  possible  for 
efficient  combined  experiments.  The  selection  of  a  particular  design  is 
best  made  in  relation  to  the  objects  of  a  particular  experiment. 

As  an  illustration  of  a  simple  type  of  combination  of  experiment 
with  routine  testing,  suppose  that  some  new  type  of  auxiliary  equipment 
is  to  be  examined  (1)  for  its  effect  on  the  range  component  f  of  mean  point 
of  impact,  (2)  as  to  its  effect  on  the  standard  deviation  sr  in  range,  and 
(3)  as  to  its  effect  on  general  accuracy.  Instead  of  two  flights  per  sight, 
four  flights  might  now  be  made  for  a  time  with  each  sight.  Two  of 
these  flights  are  to  be  on  one  day  of  a  4-day  period  and  two  on  another. 
Of  the  two  flights  with  a  sight  on  the  same  day,  one,  chosen  by  tossing  a 
coin,  is  to  carry  the  new  equipment  and  the  other  is  not.  A  valid  analysis 
of  variance  of  range  can  be  carried  out  from  the  results  to  answer 
question  (1),  and  analogous  procedures  to  answer  (2)  and  (3)  by  means 
of  sr  and  To,  respectively,  though  the  analysis  for  T0  must  either  involve 
approximations  or  wait  for  mathematical  research  on  the  subject  to 
progress  a  little  further  than  the  point  heretofore  reached.  The  accuracy 
of  determination  of  the  differences  made  by  the  new  equipment  will  be 
fully  as  great  as  if  the  entire  enterprise  were  devoted  wholly  to  this 
question.  From  this  standpoint  the  information  relevant  to  the  routine 
testing  program  is  obtained  free  of  charge. 

For  the  rating  of  men  and  equipment  routinely  employed  in  the 
testing  of  bombsights,  it  is  possible  to  utilize  the  results  without  any 
modification  of  design  other  than  objective  randomization  in  the  assign- 
ments of  the  various  flights.  This  randomization  should  be  restricted 
by  conditions  of  balance  and  orthogonality  in  order  to  have  the  most 
accurate  ratings  possible  and  to  simplify  calculations,  but  these  condi- 
tions are  not  essential.  One  simple  type  of  rating  would  be  merely  the 
average  of  the  values  of  Tl  for  the  flights  involved.  However,  the  use 
of  new  types  of  equipment  or  personnel  not  used  in  the  base  period  from 
which  the  control  limits  were  determined  will  invalidate  these  control 
limits  and  should  not  be  introduced  into  the  air  testing  of  bombsights 
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without  a  suitable  modification  of  design.  Similarly,  altitude  and  other 
conditions  attending  the  test  bombing  should  be  kept  as  nearly  as  possible 
the  same  as  in  the  base  period  until  a  deliberate  decision  is  reached  that  a 
change  should  be  made  and  maintained  over  a  considerable  period.  In 
such  an  event,  it  will  be  necessary  to  begin  over  again  the  process  of 
accumulating  data  for  new  control-chart  limits. 

3.  FORMULAS  AND  CONSIDERATIONS  AFFECTING  REVISIONS  OF 

THE  PLAN 

3.1.  General  Considerations 

3.1.1.  Occasions  for  Revision. — New  information  and  changing  condi- 
tions will  from  time  to  time  call  for  revisions  of  the  sampling  plan  specified 
in  Sec.  2.  Among  the  changes  likely  to  be  appropriate  are  alterations 
in  the  size  of  lots,  the  proportion  of  a  lot  that  is  to  be  air-tested,  the  number 
of  flights  with  each  sight,  the  number  of  bombs  dropped  per  flight,  and 
the  estimates  of  the  variances  and  covariances  of  range  and  deflection 
errors  from  which  control  limits  and  yardsticks  are  calculated. 

Another  kind  of  change  that  may  be  developed  is  the  use  of  improved 
types  of  control  charts  for  shop  inspection,  together  with  the  integration 
of  shop-inspection  procedures  with  air  testing  in  a  more  systematic  way. 
For  such  improvements  the  same  statistical  methods  that  are  here 
recommended  for  air  testing  may  be  applied  to  shop  testing,  or  to  a 
combination  of  shop  and  air  testing.  Such  a  development  would  mean 
that  instead  of  the  two  types  of  measurement,  range  and  deflection 
errors,  it  would  be  necessary  to  take  account  of  a  larger  number  of 
measures  on  each  sight  and  to  make  appropriate  combinations  of  the 
various  measures.  The  mathematical  methods  of  multivariate  statistical 
analysis  are  capable  of  extension  with  a  little  research  to  much  more 
general  situations  involving  coordinated  testing  at  different  stages. 
For  this  reason,  and  for  the  sake  of  other  analogous  applications,  certain 
formulas  will  attain  greater  generality  by  being  written  in  a  form  suitable 
for  ready  extension  to  more  than  two  variables.  Consequently,  xL  and  x2 
will  sometimes  be  used  instead  of  r  and  d  for  range  and  deflection  errors, 
and  subscripts  will  be  used  which  take  only  the  values  1  and  2  in  the 
present  chapter  but  which  may  be  understood  to  take  other  values  as  well 
in  cases  in  which  more  measurements  are  made.  For  example,  in  place 
of  the  a,  b,  and  c  introduced  in  Sec.  2.2.6,  when  generality  is  sought  In,  Z12, 
and  l22,  respectively,  will  be  used,  so  that  such  a  formula  as 

£2  =  ar*  +  2brd  +  cd2 
may  also  be  written 

T*  =  lnxl  +  2l'12x1x2  +  l22x\ 
or,  more  compactly, 

1       ==    2j  ZjiijXiXj 
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with  the  understanding  that  the  first  2  stands  for  summation  with  respect 
to  i  and  the  second  2  with  respect  to  j.  In  this  chapter,  the  summations 
will  be  over  the  values  1  and  2  only,  but  in  the  extensions  envisaged 
they  may  be  over  any  set  of  values.  Our  results  will  be  in  such  forms  as 
to  be  readily  extended  to  the  more  general  cases. 

Still  another  type  of  revision  of  this  and  other  sampling  plans  is 
likely  to  be  required  for  sequential  analysis,  by  which  the  decision  as  to 
the  number  of  tests  to  be  made  is  not  fixed  at  the  beginning  of  the  testing 
but  is  postponed  and  determined  by  the  gradually  accumulating  data. 
Considerable  economies  are  possible  by  use  of  a  Avell-worked-out  system 
of  sequential  testing.  Methods  of  applying  sequential  analysis  to  the 
particular  problem  of  this  chapter  have  not  yet  been  developed.* 

3.1.2.  Numbers  of  Sights,  Flights,  and  Bombs. — In  the  plan  suggested 
in  Sec.  2,  the  proportion  of  sights  to  be  sent  to  the  proving  ground  was 
taken  as  one-tenth  simply  because  this  has  been  the  proportion  sent  in  the 
past  and  because  the  shop-testing  facilities  appear  to  be  adequate  for 
dealing  with  this  proportion.  A  continuing  rating  of  the  inspection  at  the 
factories  in  comparison  with  the  results  of  the  shop  and  air  tests  at  the 
proving  ground,  and  a  consideration  of  the  needs  of  the  service,  might  lead 
to  the  decision  to  send  either  a  larger  or  a  smaller  proportion  of  sights  to 
the  proving  ground.  If  it  should  be  decided  to  increase  the  proportion  of 
sights  sent  to  15  or  20  percent  of  the  total,  this  could  be  accomplished  by 
sending  3  or  4  sights  from  each  lot  of  20  with  a  minimum  of  disturbance  of 
the  sampling  plan.  If,  on  the  other  hand,  it  should  be  decided  that  the 
inspection  at  the  factory  is  more  adequate  than  has  been  felt  to  be  the  case 
in  the  past,  and  a  smaller  proportion  of  sights  is  to  be  sent,  the  best  way 
to  do  this  would  be  to  increase  the  size  of  the  lot.  It  is  desirable  to 
have  the  lot  large  enough  so  that,  with  the  proportion  adopted,  at  least 
two  sights  chosen  at  random  from  each  lot  shall  be  sent  to  the  proving 
ground.  The  reason  for  this  is  that  there  should  be  available  a  con- 
tinuing measure  of  variation  within  lots,  and  this  requires  at  least  two 
sights  from  each  lot.  Such  variation  is  displayed  by  the  control-chart 
scheme  of  Sec.  2  along  with  the  measures  of  average  quality. 

If  the  number  of  sights  taken  from  each  lot  should  become  larger, 
measures  of  this  intra-lot  variation  could  be  calculated  easily  and  accu- 
rately, but  the  graphic  portrayal  by  circles  and  crosses  might  not  be  so 
clear,  and  vertical  segments  proportional  to  calculated  measures  of 
variation  might  need  to  be  used  instead.  If  collateral  experimentation 
on  methods  of  bombing,  auxiliary  equipment,  and  so  forth,  is  to  be  com- 
bined efficiently  with  test  bombing,  there  is  an  advantage  in  having  more 

*  See  Chap.  17  for  a  brief  account  of  sequential  analysis.  For  fuller  accounts  see, 
on  applications,  Statistical  Research  Group,  Sequential  Analysis,  and  on  theory,  Wald, 
"Sequential  Tests." 
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than  two  sights  chosen  at  random  from  each  lot,  as  the  larger  numbers 
make  possible  the  design  of  more  efficient  experiments.  Against  this 
consideration  is  to  be  set  the  fact  that  if  the  lots  are  large,  as  would  be 
implied  by  having  a  large  sample  constituting  a  fixed  percentage  of  a  lot, 
there  might  be  an  excessive  delay  at  the  factory  in  the  accumulation  of 
enough  sights  to  make  a  lot  so  that  the  random-sampling  process  could 
be  applied.  This  objection  to  large  lots  would  apply  particularly  where 
the  rate  of  production  of  a  factory  is  rather  small.  This  matter  is 
touched  upon  further  in  Sec.  4.2.  The  proportion  of  sights  received  at 
the  proving  ground  to  be  air-tested  has  been  arrived  at  not  by  calcula- 
tion but  somewhat  arbitrarily  from  a  general  survey  of  the  practices 
prevailing  in  the  past  and  the  opinions  of  those  most  closely  in  touch 
with  various  phases  of  the  situation. 

It  is  essential  that  at  least  two  nights  be  made  with  each  sight  in 
order  that  a  measure  of  variation  due  to  the  weather  and  chance  factors 
may  be  available.  The  number  of  bombs  dropped  per  flight  has  been 
taken  to  be  four,  because  that  was  the  most  commonly  used  number  in  the 
period  just  before  this  study  was  begun.  However,  since  six  ore-filled 
test  bombs  can  be  dropped  on  a  flight  and  since  additional  accuracy  is 
gained  by  this  practice,  it  may  prove  desirable  to  modify  the  plan  to 
fit  this  or  some  other  number  N  of  bombs  dropped  on  a  flight.  Increased 
accuracy  can  also  be  obtained  by  increasing  the  number  of  flights  per 
sight.  In  the  light  of  data  accumulating  under  the  sampling  plan,  the 
question  of  what  combination  of  numbers  of  flights  per  sight  and  of 
bombs  per  flight  gives  the  most  economical  method  of  obtaining  the 
requisite  accuracy  needs  to  be  investigated  from  time  to  time;  or  (what 
is  somewhat  the  same  thing)  what  combination  gives  the  maximum 
accuracy  obtainable  with  the  resources  available.  Such  a  combination 
can  only  be  arrived  at  by  a  study  of  comparative  costs  in  conjunction 
with  the  relation  of  variance  between  flights  to  variance  between  bombs 
on  the  same  flight. 

3.1.3.  Revision  Based  on  New  Experience. — Revision  of  the  variances 
and  covariance  of  range  and  deflection  errors  may  need  to  be  made 
because  of  changes  in  techniques  of  bombing;  it  should  eventually  be 
made  anyhow  in  order  to  have  more  accurate  estimates  of  these  param- 
eters than  are  at  present  available.  From  a  long  series  of  reasonably 
homogeneous  control-chart  data,  the  variance  of  range  s*  may  be  esti- 
mated as  the  mean  of  s2r  for  the  recorded  flights,  and  the  variance  of 
deflection  si  similarly.  There  is  a  need  also  for  an  estimate,  which  we 
denote  by  srd,  of  the  covariance  of  range  and  deflection  errors.  For  a 
single  flight  with  N  bombs  such  an  estimate  would  be 

S(r  -  f)(d  -  d)  =  Srd  -  fSd 
N  -  1  N  -  1 
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but  the  average  of  these  estimates  over  the  long  series  should  be  adopted. 
(If  N  varies  among  the  flights,  the  ratio  of  the  sum  of  the  numerators  to 
the  sum  of  the  denominators  should  be  used.)  It  is  not  necessary 
actually  to  compute  these  sums  of  products  if  all  the  calculations  going 
into  the  control  chart  have  been  correct  and  are  available,  since  the 
products  have  already  been  computed  in  the  process  of  obtaining  values 
of  T\     Indeed  (Sec.  2.2.6), 

T2DF  =  aS(r  -  rFy  +  2bS(r  -  fF)(d  -  dF)  +  cS(d  -  dF)2 

If  we  sum  for  m  nights,  each  with  N  bombs,  and  use  the  expressions  for  s2r 
and  sj,  we  may  solve  and  obtain  as  our  new  estimate  of  co variance 


1  TO  2  2 

The  number  of  degrees  of  freedom  derived  from  each  flight  is  N  —  1,  and 
therefore  from  the  m  flights  in  the  sample  it  is  m(N  —  1).  We  shall  put 
n  =  m(N  —  1).  This  important  number,  which  must  be  used  in  the 
determination  of  control  limits,  yardstick  lengths,  and  various  tests  of 
statistical  significance,  should  be  recorded  carefully. 
The  revised  values  of  the  coefficients  for  T2  are 


S2i  q£      _____       r»-<  o-'  o-*      —       O^  O^  Q"      Q" 

rbd  drd  ^r^d  drd  dr*d  drd 

The  nature  of  these  formulas  will  perhaps  be  clearer  by  reference  to  a  more 
general  case.  If,  instead  of  range  and  deflection  errors,  we  were  meas- 
uring a  considerable  number  of  features  of  each  sight  and  combining 
these  into  one  over-all  indicator,  we  should  use  for  T2  the  last  formula 
on  p.  142,  with  each  coefficient  Uj  equal  to  the  cofactor  of  the  covariance 
between  the  ith.  and  jth  measures  in  the  determinant  of  such  co  variances, 
divided  by  this  determinant.  In  the  formulas  for  a,  b,  and  c  above,  the 
common  denominator  is  the  covariance  determinant,  while  the  numer- 
ators are  the  cofactors  of  the  elements  of  this  determinant.* 

After  two  or  three  years  of  test  bombing  in  a  fairly  uniform  manner, 
it  should  be  possible  to  obtain  useful  measures  of  seasonal  variation  to 
be  applied  as  corrections.  Even  without  this  delay,  it  may  be  possible 
by  a  special  statistical  study  to  work  out  corrections  of  the  variances  to 
allow  for  the  weather.  The  effect  of  any  such  corrections  upon  the 
number  n  of  degrees  of  freedom  should  be  noted. 

*  The  reader  unfamiliar  with  determinants  can  readily  learn  enough  about  them  to 
carry  out  these  instructions  from  the  chapter  on  determinants  in  any  book  on  college 
algebra,  or  from  Chap.  VII,  "Determinants,"  in  Crum  and  Schumpeter,  Rudimentary 
Mathematics.  In  the  determinant  of  the  covariances  sa,  the  elements  corresponding 
with  i  =  j  are  simply  variances ;  thus  S22  =  sf. 
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3.1.4.  Assumptions  Involved. — The  control  limits  on  the  charts  sup- 
ply a  test  of  the  hypothesis  that  the  sights  of  a  particular  lot  are  as  good 
as  the  sights  used  in  the  base  period.  The  limits  are  so  determined  that, 
if  the  hypothesis  is  true,  the  0.05  limit  will  in  the  long  run  be  exceeded 
approximately  once  for  every  20  lots,  and  the  0.01  limit  approxi- 
mately once  for  every  100  lots.  The  functions  plotted  on  the  control 
charts  have  been  chosen  in  accordance  with  the  findings  of  statistical 
theory  as  those  most  likely  to  lead  to  these  limits  being  exceeded  if  the 
sights  tested  are  in  fact  worse  than  those  of  the  base  period. 

These  statements  are,  to  be  sure,  based  on  a  number  of  assumptions, 
but  the  assumptions  appear  reasonable.  It  is  assumed  that  the  different 
bomb  drops  are  statistically  independent  or  that  corrections  are  applied 
that  take  care  of  any  lack  of  independence.  If  the  randomization  proce- 
dure prescribed  in  Sec.  2  is  followed,  and  especially  if  suitable  corrections 
are  made  for  weather,  this  assumption  should  fit  the  facts  closely  as 
regards  test  bombing  done  under  the  plan.  Whether  true  independence 
can  be  ascribed  to  the  actual  bombings  used  in  calculating  the  initial 
control  limits  may  be  questioned,  since  no  scheme  of  objective  randomiza- 
tion was  in  effect  at  the  time  and  since  no  corrections  were  made  for 
weather  apart  from  the  selective  effect  of  there  being  no  bombing  in  the 
worst  weather.  It  is  the  writer's  impression,  based  on  inspection  of  the 
procedures  and  records,  that  lack  of  independence  in  these  bombings  is 
not  such  as  to  make  any  great  difference  in  the  control  limits.  The 
weather  factor  may  be  more  serious,  since  the  test  bombings  of  this  base 
period  were  all  made  in  November  and  December.  However,  the  charts 
kept  at  the  proving  ground  of  earlier  bombings  under  different  conditions 
failed  to  disclose  any  obvious  evidence  of  any  marked  seasonal  variation 
in  results. 

It  is  assumed  that  the  bombing  errors  in  range  and  deflection  have 
a  bivariate  normal  distribution.  This  assumption  has  a  certain  a  priori 
justification  in  the  notion  of  the  errors  resulting  from  accumulation  of  a 
large  number  of  independent  errors  and  has  also  an  approximate  a 
posteriori  verification  in  various  records  of  bombing.  There  is,  to  be  sure, 
a  slight  skewness  in  range  errors,  which  can  be  explained  by  a  small 
proportion  of  bombs  having  defects  that  cause  them  to  fall  short  with  a 
bias  peculiar  to  this  small  group.  However,  the  deviations  from  nor- 
mality appear  to  be  so  small  as  to  have  no  appreciable  influence  on  the 
final  results,  so  far  as  can  be  judged  from  general  statistical  considera- 
tions. Certainly  there  is  no  similarity  between  the  recorded  curves  of 
bomb  distribution  and  the  sharp-peaked  theoretical  ones  for  which 
mean  deviations  provide  efficient  measures  of  dispersion. 

No  assumption  is  made  in  this  chapter  either  that  correlation  of  range 
with  deflection  errors  exists  or  that  it  does  not,     The  records  used  in 
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Sec.  5  indicate  that  little  or  no  correlation  exists,  though  other  records 
consisting  of  compilations  of  huge  quantities  of  heterogeneous  material 
do  superficially  indicate  high  correlations.  The  high  correlations  in 
these  cases  can  probably  be  ascribed  to  the  mixing  of  records  of  bombings 
at  many  different  places  and  times,  with  various  kinds  of  equipment,  with 
personnel  of  widely  different  degrees  of  skill,  together  with  faulty  records 
whose  complete  compilation  proved  impossible.  Even  if  there  were  a 
substantial  correlation,  it  is  not  clear  that  this  would  be  a  serious  detri- 
ment if  the  variances  of  the  errors  were  moderate.  The  use  of  T2  in  the 
plan  is  equally  valid  whether  correlation  exists  or  not. 

In  selecting  the  T  functions,  it  is  assumed  that  no  bias  in  any  special 
direction  is  to  be  guarded  against,  but  that  equal  precautions  are  to 
be  taken  against  biases  in  all  directions.  Proofs  of  the  optimum  proper- 
ties of  T  have  been  given  by  Hsu  and  by  Wald.  * 

It  is  assumed  also  that  exact  probabilities  are  to  be  represented 
by  the  control  limits,  free  from  approximations  of  unknown  degrees  of 
accuracy.  This  consideration  eliminates  some  plausible  functions,  such 
as  the  mean,  or  mean  square,  of  the  actual  distances  of  the  bombs  from 
the  target.  While  these  functions  have  an  intuitive  appeal  and  seem 
at  first  glance  to  represent  the  real  value  of  the  bombsight  because 
closeness  of  bomb  to  target  is  the  criterion  of  effectiveness  in  action,  they 
are  not  suitable  for  use  with  proof  bombing.  They  are  inefficient 
even  for  large  samples  by  comparison  with  T%,  and  for  small  samples 
they  suffer  from  the  impossibility  of  determining  exact  probabilities 
relative  to  them  (see  Sec.  4.8). 

3.2.  Means 

3.2.1.  Control  Limits  for  Means. — The  outer  control  limits  for  the  f 
chart  are  so  chosen  that,  if  the  distribution  of  range  errors  is  the  same  as 
in  the  base  period,  the  probability  is  0.01  that  the  mean  f  of  the  range 
errors  found  with  the  sights  of  any  particular  lot  should  be  represented  by 
a  point  falling  outside  these  limits.  The  inner  control  limits  are  deter- 
mined in  the  same  way  except  that  the  probability  is  0.05.  The 
heights  of  these  lines  in  terms  of  the  scale  of  mils  on  the  chart  are  given 
by  the  formula 


f\  ±  ^'  \n>  +  ghN 
in  which  the  following  notation  is  used: 

f'  =  mean  of  range  errors  in  base  period.     (Errors  "over"  are  con- 
sidered as  positive,  errors  "short"  as  negative.) 
sj.  =  estimated  standard  deviation  of  range  errors  in  base  period, 
*  Hsu,  " Generalized  T";  Wald,  "Tests  Concerning  Several  Parameters." 
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consisting  of  the  square  root  of  the  ratio  of  a  sum  of  squares 

with  n  degrees  of  freedom  to  n 
n'  =  number  of  range  errors  averaged  in  base  period  to  obtain  f- 
N  =  number  of  bombs  dropped  per  flight  in  the  period  for  which 

the  control  chart  is  to  be  used 
g  =  number  of  flights  per  sight  in  the  period  for  which  the  control 

chart  is  used 
h  =  number  of  sights  per  lot  in  the  period  for  which  the  control 

chart  is  used 
e  =  0.01  for  outer  control  limits,  0.05  for  inner  limits 
U  =  value  of  ratio  of  deviation  in  a  mean  to  its  estimated  standard 

error  (with  n  degrees  of  freedom)  exceeded  in  absolute  value  with 

probability  e 

With  the  additional  notation 

d'  =  mean  of  deflection  errors  in  base  period   (errors  to  the  right 

considered  positive,  to  the  left  as  negative) 
s'd  =  estimated  standard  deviation  of  deflection  errors  in  base  period 

we  have  the  parallel  formula  for  control  limits  for  dL 

The  base  period  considered  in  Sec.  5  yields  the  following  numerical 
values : 

?'  =  1.3450  mils  d'  =  2.8046  mils 
4  =  6.8040  mils  s'd  =  8.3686  mils 
ri  =  260  n  =  256 

According  to  the  specifications  in  Sec.  2.1,  g  =  h  =  2  and  N  =  4.  Values 
of  te  can  be  found  in  tables  of  the  Student  distribution.*  However, 
these  tables  do  not  extend  to  the  value  n  =  256.  For  such  large  values 
te  may  be  obtained  as  the  first  two  or  three  terms  of  the  asymptotic 
series 


U  =  x 


1  +  *2  +  1  +  (a2  +  3)(5x2  +  1) 
4n  96ft2 


where  x  is  the  unit  normal  deviate  exceeded  in  absolute  value  with 
probability  e.f     We  thus  find  for  n  =  256 

*  See  Fisher  and  Yates,  Statistical  Tables,  Table  III.  This  shows  te  to  3  decimal 
places  for  e  =  0.001,  0.01,  0.02,  0.05,  and  0.1  to  0.9  by  steps  of  0.1,  for  n  =  1  to  30  by 
steps  of  1,  40,  60,  120,  and  ».  Also,  see  Merrington,  "Percentage  Points  of  the 
^-Distribution."  This  shows  te  to  5  significant  figures  for  e  =  0.005,  0.01,  0.025, 
0.05,  0.10,  0.25,  and  0.50,  for  the  same  values  of  n  used  by  Fisher  and  Yates. 

t  Values  of  x  may  be  found  from  Table  1.1  in  Chap.  1,  provided  e/2  is  used  in 
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*o.oi  =  2.59514         ^0.05  =  1.96925 

Substituting  these  values  gives  control  limits  for  fL  as  follows: 

0.01  limits:     1.3450  ±  4.5481  =  5.8931  and  -3.2031 
0.05  limits:     1.3450  ±  3.4512  =  4.7962  and  -2.1062 

For  dL,  in  the  same  way,  we  find 

0.01  limits:     2.8046  ±  5.5940  =  8.3986  and  -2.7894 
0.05  limits:     2.8046  ±  4.2448  =  7.0494  and  -1.4402 

3.2.2.  Asymmetry  of  Control  Limits  and  Bias  in  Base  Period. — It  may 

seem  strange  that  the  control  limits  are  not  symmetrical  with  respect 
to  the  zero  line  on  the  f  and  d  charts,  since  sights  whose  use  gives  mean 
points  of  impact  close  to  the  target  seem  more  desirable  than  those 
yielding  more  remote  MPI's.  A  like  question  arises  in  connection  with 
TM  and  T0.  However,  it  should  be  emphasized  that  our  comparisons 
are  not  with  a  theoretical  ideal  but  with  the  actual  performance  of  a 
base  period  in  which  the  testing  procedure  is  presumed  to  be  comparable 
with  that  in  the  period  for  which  the  control  chart  is  to  be  used.  The 
material  on  which  our  records  are  based  sIioavs  biases  in  both  range  and 
deflection  errors  in  excess  of  any  that  could  reasonably  be  ascribed  to 
chance.  Whether  these  biases  resided  in  the  sights,  in  the  habits  of  the 
test  bombardiers  or  pilots,  in  some  other  feature  of  the  test  bombing, 
or  in  the  observation  of  the  results  we  do  not  know.  The  sights  used 
were  accepted.  If  the  bias  was  the  fault  of  the  sights,  this  fact  should 
be  ascertained  by  an  ad  hoc  inquiry  and  the  manufacturing  design  changed 
accordingly.  If  the  bias  was  a  result  of  something  in  the  test  bombing 
other  than  the  sights,  and  if  this  condition  continues  in  the  period  for 
which  the  control  charts  are  to  be  used,  it  is  necessary  that  the  control 
limits  be  asymmetrical  with  respect  to  the  zero  line  in  order  to  avoid 
wholesale  false  alarms  regarding  sights  that  are  really  quite  satisfactory 
but  with  which  the  test  bombs  fall  moderately  farther  over  and  to  the 
right  than  the  average.  If  the  proof-bombing  procedure  is  such  as  to 
create  the  observed  bias,  a  sight  should  be  labeled  unsatisfactory  much 
more  readily  for  bomb  errors  to  the  left  and  short  than  for  errors  of  the 
same  magnitude  in  the  direction  in  which  the  bomb  errors  are  usually 
made.  Use  of  the  asymmetrical  limits  is  equivalent  to  applying  a 
correction  to  each  bomb  coordinate  to  compensate  for  the  bias  observed  in 
the  base  period. 

entering  the  table.  The  value  of  x  for  e  =  0.01  is  2.57582930,  and  for  e  =  0.05  it  is 
1.95996398. 

The  series  for  te  in  terms  of  x  is  equivalent  to  a  series  for  x  in  terms  of  te  given  by 
Hotelling  and  Frankel,  "  Transformation  of  Statistics,"  and  by  Goldberg  and  Levine, 
"t  and  x2";  the  first  two  terms  are  also  given  by  Peiser,  "Significance  Levels." 


150  TECHNIQUES  OF  STATISTICAL  ANALYSIS  Chap.  3 

If  a  change  in  bombing  procedures,  or  in  methods  of  observing  the 
positions  at  which  the  bombs  fall,  results  in  elimination  or  alteration  of 
the  observed  bias,  the  control  limits  should  be  changed  accordingly. 
Until  this  is  done  they  should  be  based  on  experience. 

If  an  investigation  leads  to  the  conclusion  that  freedom  from  bias 
has  been  achieved  in  the  test  bombing  and  is  likely  to  continue,  the  control 
limits  for  f  and  d  should  be  symmetrical  about  zero,  at  the  same  distances 
as  from  the  central  lines  previously  indicated.  At  the  same  time,  fM 
should  be  replaced  by  TM,  and  T0  by  T0,  both  as  to  calculated  values  and 
in  the  labeling  of  the  control  chart. 

Other  considerations  involving  the  fallibility  of  the  base-period  sample 
arise  in  connection  with  the  effect  on  the  distance  between  upper  and 
lower  control  limits  of  the  number  of  bombs  in  the  base  period.  The 
reciprocal  of  this  number  n'  is  one  of  the  terms  under  the  radical  sign  in 
the  formulas  for  the  control  limits,  so  that  if  the  control  limits  are  based 
on  a  period  in  which  very  few  bombs  were  dropped,  the  limits  will  be 
relatively  far  apart.  In  such  a  period  t€  will  also  be  slightly  larger  than 
if  numerous  bombs  were  dropped  and  will  push  the  control  limits 
still  farther  apart.  This  fact  is  a  consequence  of  our  ability  to  make 
clear  comparisons  only  with  past  observations.  If  the  preliminary 
sample  with  which  we  compare  a  new  sample  is  small,  the  possibility  of 
establishing  a  difference  when  one  really  exists  is  diminished. 

The  effect  of  n'  on  the  control  limits  is  rather  slight  for  numbers  of 
any  such  orders  of  magnitude  as  we  have  been  considering.  There 
seems  no  reason  here  why  the  control  limits  should  not  be  revised  after 
every  200  or  300  bombs,  if  it  is  thought  that  fundamental  conditions 
have  changed  in  such  an  interval.  A  more  serious  objection  to  such 
frequent  revision  for  the  present  is  the  lack  of  exact  probabilities  when  n 
is  small. 

3.2.3.  Yardsticks  for  Differences  of  Means. — Unlike  the  control 
limits,  these  yardsticks  are  unaffected  by  the  bias  in  the  MPFs  of  the  base 
period.  The  lengths  of  the  yardsticks  for  means  of  flights,  sights,  and 
lots  are,  respectively, 


^0.05  yljj  S'  ^0.05  A/Ty  S'  ^.05  'J^TKt  «' 


2^ 

ghN 


where  s'  is  to  be  replaced  by  s'r  for  the  range  yardsticks  and  by  sd  for  the 
deflection  yardsticks,  and  the  value  of  £0.o5  is  to  be  determined  with 
reference  to  the  value  of  n  underlying  the  estimates  of  variance  in  the 
base  period.  For  g  =  h  =  2,  N  —  4,  and  the  same  values  of  s'r  and  s'd 
used  previously  on  the  basis  of  experience,  these  expressions  take  the 
following  values,  which  are  the  lengths  of  the  yardsticks  shown  in  Figs. 
3.1  and  3.2: 
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Flights 

Sights 

Lots 

Range 

9.4741 
11.6527 

6 . 6992 
8.2397 

4.7370 

Deflection 

5.8263 

3.3.  Variances 

3.3.1.  Control  Limits  for  Variances. — The  control  limits  on  the  charts 
for  s2,  and  sj  are  fixed  in  accordance  with  the  distribution  of  the  ratio  cf 
two  independent  estimates  of  the  same  variance.  This  ratio  is  usually 
denoted  by  F,  since  R.  A.  Fisher  was  the  first  to  work  with  the  distribu- 
tion and  to  show  its  extensive  usefulness  in  statistical  work  of  many 
kinds.* 

To  obtain  the  0.01  control  limit  for  s2.,  we  put  s2/s'2  =  F,  and  obtain 
from  the  table  of  1  percent  points  the  value  of  F  corresponding  to 


nx  ~=  gh(N  -  1) 


and 


n2  =  n 


From  this  value  of  F  and  the  value  s'2  found  from  the  base  period  we 
obtain  s2,  the  upper  control  limit.     According  to  the  plan  specified  in  Sec. 

12.     For  the  base  period  of 


=  h  =  2  and  N  =  4,  so  that 


nx 


2.1, 

Sec.  5,  n  —  256  and  s'r2  =  46.295.  From  the  tables  it  is  found  that,  for 
hx  —  12  and  n2  =  256,  the  1  percent  point  of  F  is  approximately  2.2544. 
Multiplying  this  by  46.295  gives  the  value  104.4  as  the  1  percent  control 
limit  for  s2r. 

The  numerical  values  for  the  control  limits  on  the  s2  scale  of  Figs. 
3.3  and  3.4  are  found  in  this  way  and  given  below: 


Range .... 
Deflection 


0.01  limit       0.05  limit 


104.4 
157.9 


82.9 
125.4 


These  control  limits  take  full  account  of  the  fact  that  the  variances 
derived  from  the  preliminary  sample  are  not  exactly  those  of  the  popula- 
tion but  are  based  on  256  degrees  of  freedom  instead  of  an  infinite  number. 
If  the  same  variance  estimates  had  been  obtained  from  a  preliminary 
sample  of,  say,  1,000,000  degrees  of  freedom,  the  control  limits  for  range 
variance  would  have  been  reduced  to  101.2  and  81.1,  with  proportionate 
reductions  in  the  case  of  deflection. 

*  For  a  table  of  F,  see  Fisher  and  Yates,  Statistical  Tables,  Table  V,  where  F  is 
called  "the  variance  ratio."  This  table  contains  a  footnote  which,  since  it  may  be 
misinterpreted,  should  be  disregarded  in  following  the  instructions  in  this  chapter. 
A  more  extensive  table  of  F  is  given  by  Merrington  and  Thompson,  "Percentage 
Points  of  the  Inverted  Beta  (F)  Distribution." 
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3.3.2.  Yardsticks  for  Variances. — The  yardsticks  on  the  variance 
charts  have  no  relation  whatever  to  the  base  period  but  depend  only  on 
the  numbers  of  degrees  of  freedom  in  the  current  variance  estimates 
that  are  to  be  compared.  Their  lengths  are  the  same  for  the  charts  of 
range  variance  and  of  deflection  variance.  The  vertical  distance  between 
two  plotted  points  has  probability  0.05  of  exceeding  the  length  of  the 
appropriate  yardstick  if  the  variances  plotted  are  obtained  from  samples 
from  the  same  hypothetical  infinite  population.  If  F  is  the  ratio  of  two 
such  sample  variances  with  equal  numbers  of  degrees  of  freedom,  log  F 
has  a  symmetrical  distribution  extending  from  minus  to  plus  infinity. 
The  value  of  log  F  corresponding  to  the  length  of  the  yardstick  is  such 
that  its  probability  of  being  exceeded  in  absolute  value  is  0.05.  Hence, 
the  probability  of  a  random  value  being  actually  greater  than  this  is 
0.025,  and  the  probability  of  its  being  less  than  the  negative  of  this  yard- 
stick value  is  also  0.025.  The  yardstick  length  would  thus  correspond 
to  the  23^  percent  point  of  the  F  distribution.* 

For  each  flight  the  number  of  degrees  of  freedom  for  the  differences 
among  the  range  coordinates  of  the  N  bombs  is  N  —  1;  this  is  equal  to 
the  number  of  degrees  of  freedom  of  the  deviations  of  these  N  numbers 
from  their  mean.  The  variance  estimates  for  sights  and  for  flights  are 
obtained  by  adding  the  sums  of  squares  of  deviations  from  the  respective 
flight  means  and  dividing  by  the  sum  of  the  numbers  of  degrees  of 
freedom.  Thus  the  number  of  degrees  of  freedom  in  the  variance  esti- 
mate for  a  sight  is  g(N  —  1),  while  for  a  lot  it  is  gh(N  —  1). 

For  the  specified  values  g  =  h  =  2,  N  =  4  the  numbers  of  degrees 
of  freedom  for  flights,  sights,  and  lots,  respectively,  are  therefore  3,  6, 
and  12.  The  critical  values  of  F  in  these  three  cases  are  15.439,  5.8197, 
and  3.2773.  On  the  logarithmic  scale  of  s2  at  the  left  of  Fig.  3.3  the 
heights  from  1  to  these  three  numbers  are  the  lengths  of  the  yardsticks. 
On  Fig.  3.4  the  lengths  are  the  same. 

3.4.  Comprehensive  Measures 

3.4.1.  Control  Limits  for  the  T's. — When  the  variance  estimates  for 
the  base  period  utilize  a  large  number  n  of  degrees  of  freedom,  such  as 
the  value  256  here  used,  the  distribution  of  T2  is  approximately  the  same 
as  the  x2  distribution  that  has  been  tabulated  and  widely  used  for  a 
variety  of  statistical  purposes,  f  This  distribution  depends  on  a  number 
of  degrees  of  freedom  /,  which  may  be  defined  in  the  following  manner. 
For  each  bomb  the  measure  of  discrepancy  from  the  target 

*  The  2}4  percent  points  of  F  are  not  given  by  Fisher  and  Yates  but  are  given  by 
Merrington  and  Thompson. 

t  For  tables  of  the  x2  distribution,  see  Fisher  and  Yates,  Statistical  Tables,  Table 
IV;  or  Thompson,  "Percentage  Points  of  the  %2  Distribution." 
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has  (approximately  for  large  n)  the  x2  distribution  with  2  degrees  of 
freedom,  provided  the  bomb  is  dropped  under  essentially  the  same  condi- 
tions as  the  bombs  in  the  base  period,  and  provided  the  probability  dis- 
tribution is  actually  centered  at  the  target  without  bias.  The  sum  of 
any  number  of  independent  values  of  x2  has  the  x2  distribution  with  a 
number  of  degrees  of  freedom  equal  to  the  sum  of  the  numbers  for  the 
several  values  of  x2  added.  Thus  T20FJ  the  sum  of  the  N  values  of  T20B  for 
the  N  bombs  dropped  in  a  flight,  has  approximately  the  x2  distribution 
with  2N  degrees  of  freedom.  Since  g  nights  are  made  with  each  sight, 
T%s  has  2gN  degrees  of  freedom,  and  since  h  flights  of  each  lot  are 
tested,  T%L  has  2ghN  degrees  of  freedom.  Since  in  the  specific  plan  of 
Sec.  2.1,  g  —  h  —  2  and  N  =  4,  control  limits  for  T%L  may  be  obtained 
approximately  from  the  0.01  and  0.05  probability  levels  for  x2  with 
32  degrees  of  freedom. 

Under  this  same  condition  of  absence  of  bias,  fF  \/N  and  dF  \/N 
have  the  same  joint  distribution  as  r  and  d.     Consequently 

T2MF  =  N(af2  +  2bfd  +•  cd2) 

has,  with  the  same  approximation  as  before,  the  x2  distribution  with  2 
degrees  of  freedom.  For  the  gh  flights  involved  in  testing  a  sample  lot, 
the  sum  T2ML  of  these  quantities  has  approximately  the  x2  distribution 
with  2gh  degrees  of  freedom.  For  the  plan  of  Sec.  2. 1  this  means  8  degrees 
of  freedom  for  T2ML. 

The  distribution  of  a  sum  of  squares  of  deviations  from  a  mean  is 
independent  of  the  distribution  of  the  mean  itself  when,  as  is  assumed 
throughout,  the  fundamental  distribution  is  of  the  normal  form.  In  like 
manner  the  sum  of  the  products  of  the  deviations  of  r  and  d  from  their 
respective  means  is  independent  of  these  means  in  the  probability  sense. 
It  follows  that 

T2DF  =  aS(r  -  fF)2  +  2bS(r  -  fF)(d  -  dF)  +  cS(d  -  dF)2 
is  independent  of  T2MF.     Since 


and  since  TMF  has  2  degrees  of  freedom  while  T0F  has  2N ,  it  follows  that 
T2DF  has  approximately  the  x2  distribution  with  2(iV  —  1)  degrees  of 
freedom.  Since  T%L  is  the  sum  of  gh  independent  values  of  T2DF,  the 
number  of  degrees  of  freedom  for  TDL  is  2gh(N  —  1),  a  number  which  for 
the  plan  of  Sec.  2.1  equals  24. 

It  is  assumed  in  the  foregoing  paragraphs  that  r  and  d  are  free  from 
bias.     If  there  is  a  bias,  the  same  in  the  new  as  in  the  old  bombings, 
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r  —  f'  and  d  —  d'  will  be  free  of  bias  but  will  have  variances  1  +  \/n' 
times  those  of  r  and  d,  respectively.  (Here  n'  is  the  number  of  bombs  in 
the  old  sample,  equal  to  260  for  the  base  period  considered.)  Hence,  if 
we  replace  r  and  d,  respectively,  by 


\  n 


and 


d-d' 


\  n 


in  T2OB,  the  resulting  statistic 


^2      _  a(r  -  f'Y  +  2b(r  -  f')(d  -  d')  +  c(d  -  d')2 

has  the  distribution  that  T20B  would  have  in  the  unbiased  case.  The  sum 
TlL  of  Ngh  values  of  T20B  therefore  has  the  same  approximate  x2  distribu- 
tion with  2Ngh  degrees  of  freedom  as  T%L. 

The  mean  range  fF  for  the  N  bombs  of  a  flight  differs  from  the  mean 
range  f  for  the  n'  bombs  of  the  base  period  by  an  amount  whose  variance 
is  l/N  +  1/n'  times  the  variance  of  range  in  general.  The  like  is  true 
of  the  difference  dF  —  d' .     Consequently 

^2      _  a(fF  -  f'Y  +  2b(fF  -  ?){dF  -  d')  +  c(dF  -  d'Y 

*  MF    ~  11 

N  +  n' 

has  the  same  approximate  x2  distribution  as  T%F  with  2gh  degrees  of 
freedom. 

These  approximate  distributions  require  correction  for  the  limited 
number  n  of  degrees  of  freedom  of  the  sample  in  the  base  period.  If  a 
base  period  with  a  value  of  n  less  than,  say,  100  were  taken,  the  correction 
would  be  substantial.  As  the  value  of  n  increases,  the  exact  x2  distribu- 
tion is  approached  as  a  limit. 

No  study  of  the  exact  distributions  of  the  various  T  statistics  here 
used  has  been  published  except  for  the  simplest  case.  The  probability 
that  T0B  should  exceed  any  assigned  value  T  is  known  to  be* 


This  is  the  special  case  for  p  —  2  of  the  distribution  of  the  generalized 
Student  ratio  given  by 

1       —    2j  Z/tijXiXj 

where  Uj  is  the  element  in  the  ith  row  and  jth  column  of  a  p-rowed  sym- 
metric matrix  inverse  to  the  matrix  of  sample  covariances  s#  between  Xi 
*  See  Hotelltng,  "Generalization  of  Student's  Ratio,"  formula  (26). 
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and  Xj  found  in  a  former  sample  with  n  degrees  of  freedom  for  each  of 
the  p  variates.  The  element  of  probability  for  this  generalized  Student 
ratio,  namely, 


2r . 

TP-idT 


(f)r(»_^±.)„„(1+?)<- 


+  D/2 


approaches  that  for  %  with  p  degrees  of  freedom  when  n  increases.  The 
probability  integral  of  this  distribution  may  be  evaluated  in  various  ways, 
one  of  which  is  to  use  the  asymptotic  expansion* 


X2  ~  T2 


1  _  P+  T2    .    4  ~  P2  +  (2  +  $V)T2  +  8714 


2n  24n' 


in  conjunction  with  the  x2  distribution.     From  it  may  be  obtained  the 
asymptotic  expansion  for  T2  in  terms  of  x2,  namely, 


^s^ 


2  h  4-  ^  +  ^2  4.  7p2  -  4  +  (13p  -  2)x2  +  4X4 
X         "^      2n      "^  24n2 


This  formula  is  particularly  suitable  for  determining  the  values  of  T2 
corresponding  to  any  particular  probability  level  from  those  already 
tabulated  for  x2- 

Probabilities  connected  with  the  T  distribution,  or  values  which  will 
be  exceeded  with  assigned  probabilities,  can  also  be  evaluated  by  means 
of  the  transformation 

npF 


rp2 


P+    1 


where  F  has  the  variance  ratio  distribution,  of  which  tables  are  available, 
with  rii  =  p  and  n2  =  n  —  p  +  1  degrees  of  freedom.  This  method  is 
particularly  suitable  for  small  values  of  n. 

This  known  T  distribution  may  be  used  temporarily  as  an  approxima- 
tion for  the  distributions  of  the  T  statistics  used  in  the  plan  presented  in 
this  chapter.  It  has  been  so  used  in  obtaining  the  control  limits  listed 
in  Sec.  2.2.6,  which  are  shown  in  Figs.  3.5,  3.6,  and  3.7.  For  this  approxi- 
mation the  p  of  the  foregoing  formulas  is  replaced  by  the  number  of 
degrees  of  freedom  /,  which  in  Sec.  2  has  the  values  8,  24,  and  32  for 
Tml,  Tdl,  and  TV,  respectively.  In  general,  /  equals  2gh,  2gh(N  —  1), 
and  2ghN  for  these  several  cases,  f 

*  See  Hotelling  and  Frankel,  "Transformation  of  Statistics,"  p.  96. 

f  An  article  on  the  mathematical  theory  of  this  chapter,  which  is  expected  to 
appear  in  the  Annals  of  Mathematical  Statistics,  will  include  the  exact  distribution  of 
these  7"s. 
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3.4.2.  Yardsticks  for  the  T's—  The  values  of  T2  obtained  from  differ- 
ent flights  are  not  exactly  independent  in  the  probability  sense,  since 
they  involve  the  same  values  of  a,  b,  and  c,  which  are  subject  to  the 
sampling  errors  of  the  base  period,  which  will  be  small  if  n  is  large. 
For  this  reason  it  appears  that  the  several  values  of  T2  will  have  positive 
correlations,  but  that  these  will  be  very  small  when  n  is  as  large  as  in  the 
sample  here  considered.  Apparently  no  great  error  will  be  made  by 
assuming  independence. 

Pending  the  development  of  a  more  exact  treatment,  we  may  obtain 
yardsticks  for  the  T  charts  by  assuming  independence  and  attributing  to 
each  T  the  x  distribution.  The  ratio  of  two  independent  values  of  T2, 
each  with  /  degrees  of  freedom,  has  thus  approximately  the  distribution 
of  the  ratio  of  two  independent  values  of  x2,  that  is,  F,  with  ni  =  n2  =  /. 
Since  yardsticks  are  desired  with  a  probability  0.05  of  being  exceeded 
in  length  on  the  logarithmic  scale,  the  tables  of  the  2}4  percent  points 
of  F  may  be  used.  The  values  of  /  for  single  flights  are  2N,  2,  and 
2(N  —  1)  for  To,  TM,  and  TD,  respectively.  These  values  of  /  are  to  be 
multiplied  by  g  for  sights  and  by  gh  for  lots. 

4.  BACKGROUND  AND  REASONS  FOR  THE  PLAN 
4.1.  Objectives  of  Sampling  Inspection 

Apart  from  special  experimental  and  exploratory  purposes,  two 
principal  objects  can  be  gained  efficiently  by  means  of  routine  inspection 
and  testing  of  small  samples,  and  a  third  object  can  at  the  same  time  be 
attained  in  a  somewhat  less  adequate  fashion.  From  small  samples 
it  is  possible 

(1)  To  judge  efficiently  the  general  level  of  quality  of  the  product  of 
a  factory  or  a  part  of  a  factory  over  a  period,  in  comparison  with  the 
output  of  other  productive  units,  and  of  the  same  unit  at  other  times. 

(2)  To  detect  promptly  irregularities,  trends  in  quality,  and  devia- 
tions from  customary  levels. 

(3)  To  select  and  reject  individual  bad  articles,  or  bad  lots.  This 
function  is  often  considered  the  primary  one.  However,  the  efficiency 
of  small  samples  for  this  purpose,  when  the  proportion  of  defectives 
that  can  be  tolerated  is  small,  is  not  so  satisfactory  as  is  sometimes 
imagined.  This  has  beeen  brought  out  in  a  recent  article,  "  Rational 
Sentencing,"  by  Dr.  E.  H.  Sealy. 

On  the  basis  of  the  small  samples  (5  to  10  per  lot  of  2,000)  which  it  is  possible 
to  fire  at  ammunition  proof,  it  is  quite  impossible  to  distinguish  between  the 
qualities  of  individual  lots,  or  to  sort  out  individual  bad  lots  from  the  bulk  in 
any  but  the  grossest  cases.  Proof  specifications  of  the  type  which  state  that 
a  lot  will  be  accepted  if  its  proof  is  clear  and  rejected  or  reproved  if  certain  defects 
appear  at  proof  have  very  little  effect  upon  the  quality  of  the  bulk  being  inspected. 
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This  has  often  been  demonstrated  in  cases  where  the  test  is  nondestructive  by 
100  percent  inspecting  the  bulk  both  before  and  after  proof.  The  percentages 
defective  found  at  100  percent  inspection  of  the  accepted  and  rejected  lots  are 
almost  invariably  found  to  be  nearly  equal  and  both  very  little  different  from  the 
percentage  of  defectives  in  the  original  bulk  presented  for  proof.  In  other  words, 
the  quality  of  the  product  passed  into  the  service  under  such  a  sentencing  system 
is  practically  that  which  happens  to  be  made  at  the  factory.  .  .  . 

It  is,  then,  quite  impracticable  to  attempt  to  discriminate  between  individual 
lots  by  means  of  a  small  sample  proof.  What  we  can  get,  however,  from  such  a 
proof  is  a  knowledge  of  the  average,  over-all  quality  of  the  product  of  any  given 
factory,  for  the  accumulation  of  many  small  sample  proofs  amounts  in  time  to 
bulk  of  evidence  which  there  is  no  gainsaying.  .  .  . 

Broadly  speaking,  then,  the  object  of  rational  sentencing  is  not  to  sort  out 
individual  good  and  bad  lots  but  to  estimate  qualities  assumed  to  be  reasonably 
constant.  If  we  estimate  that,  over  a  long  period,  a  firm  produces  consistently 
a  product  no  better  than  3  percent  defective,  then  if  we  need  this  firm's  produc- 
tion, we  must  perforce  accept  it  as  a  whole  along  with  its  3  per  cent  of  defectives, 
for  this  is  the  best  this  firm  can  do  under  its  present  state  of  knowledge  and  equip- 
ment. In  such  a  case  there  will  be,  on  the  average,  about  60  defectives  in  each 
lot  of  2,000,  and  clearly,  by  chance,  when  picking  a  sample  at  random  from  a  lot, 
one  or  more  of  these  60  defectives  will  from  time  to  time  be  found.  But  it  is 
irrational  to  suppose  that  the  lots,  none  of  whose  60  defectives  chanced  to  be 
chosen  in  the  proof  sample,  are  any  better  than  the  others. 

Dr.  Sealy  has  elsewhere*  shown  the  great  difficulty  of  devising  a 
practical  small-sample  scheme  that  Avill  reduce  to  reasonable  dimen- 
sions both  the  producer's  risk  of  having  good  lots  rejected  and  the 
consumer's  risk  of  having  bad  lots  accepted,  when  the  definition  of  good- 
ness is  that  the  proportion  defective  in  a  lot  shall  be  not  more  than  about 
2  percent.  His  work  is,  however,  confined  to  cases  in  which  the  result 
of  the  inspection  of  each  article  is  merely  to  classify  it  as  defective  or 
passable.  The  crudeness  of  this  type  of  inspection  in  comparison  with 
quantitative  measurements  is  partly  responsible  for  the  difficulties  he 
emphasizes.  With  the  relatively  refined  statistical  methods  recom- 
mended in  this  chapter  for  quantitative  measures  of  bombsight  quality, 
the  chances  of  detecting  a  bad  lot  are  much  greater  than  by  the  simple 
procedures  whose  limitations  he  points  out.  The  fact  remains,  however, 
that  too  much  should  not  be  expected  of  small  samples  from  the  stand- 
point of  catching  infrequent  defects. 

One  way  to  meet  this  difficulty  is  to  increase  the  size  of  the  sample 
by  lumping  together  successive  lots,  while  keeping  the  same  proportion 
of  inspection,  so  as  to  have  larger  samples.  This  procedure,  however, 
introduces  new  and  formidable  difficulties.  It  is  objectionable  to  keep 
lots  of  thousands  on  hand  for  the  long  periods  required  for  selection  and 

*  "Acceptance  Sampling." 
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testing  of  samples;  and,  more  fundamentally,  production  conditions  are 
likely  to  change  considerably  during  the  accumulation  and  testing  proc- 
esses for  such  samples,  so  that  the  homogeneity  necessary  for  accurate 
inferences  is  lost. 

4.2.  Reasons  for  Random  Selection  and  for  Lot  Size  20 

It  is  evident  that  the  detection  of  individual  defective  bombsights, 
or  small  lots  of  bombsights,  needs  to  be  done  primarily  by  tests  in  the  shop 
rather  than  in  the  air.  The  main  reliance  for  this  purpose  must  be  on  the 
painstaking,  detailed  inspection  of  every  sight  made  by  the  government 
inspectors  at  the  factory.  The  inspection  at  the  proving  ground  of  10 
percent  of  the  sights  can  be  only  a  partial  check  on  the  inspectors  at 
the  factories,  incapable  of  reliability  in  detection  of  bad  small  lots  or 
individual  sights  in  any  but  the  grossest  cases. 

The  real  functions  of  testing  and  inspection  at  the  proving  ground 
must  be  taken  primarily  to  be  the  two  set  forth  at  the  beginning  of  the 
preceding  section,  namely,  (1)  the  continuing  appraisal  of  general  level 
of  quality,  and  (2)  the  detection  of  trends,  excessive  irregularity,  and 
differences  between  sights  produced  at  about  the  same  time  in  the  same 
factory. 

To  carry  out  these  functions  it  is  essential  to  have  at  the  proving 
ground  sights  chosen  at  random.  Selection  of  those  apparently  the 
worst,  as  has  been  done  in  the  past,  introduces  not  only  a  bias,  which 
could  be  compensated  by  calculation  after  the  accumulation  of  experience 
with  sufficient  numbers  chosen  at  random,  but  the  much  more  serious 
difficulty  of  excessive  variation.  To  carry  out  accurately  the  two 
leading  functions  of  sampling  inspection  on  the  basis  of  samples  thus 
selected  is  practically  impossible.  The  worst,  or  the  least,  or  the  greatest, 
with  respect  to  any  characteristic,  in  samples  of  10  or  more  has  a  distribu- 
tion far  removed  from  that  of  the  general  population  from  which  the 
samples  were  drawn,  not  only  in  average  level,  but  also  in  variability 
from  sample  to  sample  and  in  the  form  of  the  distribution.  It  is  these 
considerations  that  lead  to  the  recommendation  to  select  10  percent  of 
the  output  at  random,  in  place  of  taking  the  worst  1  in  10.  ■ 

We  do  not  recommend  taking  1  in  each  successive  lot  of  10  at  random, 
but  rather  taking  2  at  random  from  each  of  successive  lots  of  20.  This 
might  be  replaced  by  taking  3  at  random  from  each  of  successive  lots  of 
30,  or  similarly  with  large  numbers.  However,  the  use  of  these  larger 
lots  would  mean  delay  while  they  were  being  accumulated  and  would 
increase  the  trouble  with  fractions  of  lots  remaining  as  tag  ends  with  every 
change  of  model.  An  even  more  cogent  objection  to  large  lots  is  that 
the  larger  the  lot  the  greater  is  the  chance  of  some  change  in  the  condi- 
tions of  production  while  it  is  being  accumulated.     Such  a  change  means 
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heterogeneity  within  the  lot,  which  weakens  the  inferences  from  it,  and 
means  that  the  existence  of  the  change  is  not  likely  to  be  detected  so 
soon  as  if  the  lots  were  smaller.  It  is  particularly  important  that 
deteriorations  in  quality  be  discovered  promptly,  in  order  that  further 
defective  sights  may  be  prevented  before  a  great  number  of  them  accumu- 
lates. These  considerations  lead  to  a  choice  of  size  of  lot  as  small  as  is 
consistent  with  having  a  plurality  of  individual  sights  chosen  at  random 
from  it. 

The  use  of  10  percent  as  the  proportion  of  sights  to  be  sent  to  the 
proving  ground  is  neither  defended  nor  opposed  here  but  is  adopted  as  a 
basis  of  calculation  because  this  proportion  has  been  used  in  the  past. 

4.3.  Relation  to  Experimental  Design 

One  reason  for  using  at  least  two  sights  chosen  at  random  from 
each  lot  is  in  order  to  obtain  a  ready  measure  of  variation  within  the  lot, 
so  that  excessive  differences  can  be  observed  with  the  help  of  the  yard- 
sticks provided  for  the  purpose  and  attention  can  thus  be  directed  to 
possible  heterogeneity  in  the  productive  process  that  might  lead  to 
deterioration.  Another  reason  is  the  desirability  of  averaging  together 
results  from  different  sights  of  a  lot  in  order  to  eliminate  some  of  the 
chance  variation  and  make  trends  and  excessive  errors  more  clearly 
discernible. 

These  two  reasons  are  features  of  the  modern  theory  of  the  design 
of  experiments,  which  rests  on  the  twin  principles  of  replication  and 
randomization.*  Replication  (that  is,  repetition)  has  two  values.  One 
of  these  is  the  increased  accuracy  obtainable  by  averaging  the  results  of 
several  observations.  The  other,  which  is  of  the  greater  importance 
though  perhaps  less  obvious,  is  the  provision  of  data  for  estimating  the 
variance  due  to  errors  of  observation,  data  consisting  essentially  of 
the  differences  between  the  results  obtained  from  experiments  in  which 
the  controlled  conditions  are  the  same.  Knowledge  concerning  this 
error  variance  is  necessary  before  any  conclusion  can  be  drawn,  since 
real  effects  can  only  be  distinguished  from  chance  when  information  is 
available  as  to  the  distribution  of  the  chance  factor.  It  is  true  that 
estimates  of  chance  variance  can  sometimes  be  drawn  from  other  analo- 
gous experiments  or  from  general  knowledge;  but  such  estimates  are  of 
doubtful  validity  when  applied  to  a  new  set  of  data  that  must  inevitably 
differ  somewhat  in  origin  from  those  provided  by  older  experiments. 
The  most  satisfactory  estimate  of  variance  is  that  provided  by  the  self- 
contained  experiment,  which  is  arranged  from  the  beginning  so  as  to 

*  See  Fisher,  Design  of  Experiments,  'especially  Sees.  9  and  10  of  Chap.  2,  Sec.  20 
of  Chap.  3,  and  Sees.  25  and  26  of  Chap.  4. 
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yield  such  an  estimate  at  the  same  time  as  the  measures  of  the  main 
effects  to  be  investigated. 

Randomization  is  necessary  in  order  to  make  the  results  valid,  in 
the  sense  that  the  calculated  estimates  are  free  from  bias.  When  the 
selection  of  cases  for  particular  kinds  of  treatment  is  made  by  any  human 
being,  and  especially  by  a  person  familiar  with  the  things  selected,  there 
is  grave  danger  of  bias,  conscious  or  unconscious,  in  the  assignments. 
Zeal  and  conscientious  care  in  the  selection  are  no  safeguards  against 
bias,  as  has  been  demonstrated  repeatedly  in  a  variety  of  fields.  The 
principal  reason  for  this  is  that  attempts  to  get  a  " representative" 
sample  or  set  of  conditions  by  conscious  choice  usually  result  in  excessive 
variance  within  the  group  in  a  way  that  cannot  be  taken  account  of  by 
any  possible  calculation  when  the  probabilities  necessary  for  testing  the 
significance  of  the  results  are  sought.  Such  difficulties  can  be  overcome 
only  through  the  actual  utilization  of  the  mechanisms  of  games  of  chance 
or  their  equivalents.  The  general  framework  of  the  experiment  should 
be  established  by  careful  design;  but  within  this  framework  the  allocation 
of  the  particular  treatments  to  individual  cases  must  be  strictly  at 
random. 

When  applied  to  the  sampling  inspection  of  articles  coming  suc- 
cessively from  a  production  line,  the  principles  of  replication  and  randomi- 
zation suggest  the  following  procedure:  If  10  percent,  for  example,  of 
the  product  is  to  be  subjected  to  a  specially  intensive  testing,  with  a 
fixed  number  chosen  from  each  successive  batch  of  some  chosen  size, 
these  principles  call  for  the  selection  at  random,  within  each  batch,  of  at 
least  two  articles.  Thus,  samples  of  10  per  cent  taken  in  accordance 
with  these  principles  require  batches  of  at  least  20  articles. 

A  seemingly  more  natural  and  perhaps  a  more  usual  practice  when  a 
10  percent  sample  is  to  be  taken  from  a  production  line  is  to  take  every 
tenth  article.  If  there  should  be  a  cycle  of  period  10  in  any  part  of  the 
productive  process,  this  practice  would  introduce  a  biased  error  into  the 
averages.  It  also  fails  to  provide  a  continuing  estimation  of  variance  and 
therefore  encourages  a  reliance  on  the  variance  obtained  from  data 
available  when  the  system  was  inaugurated,  observations  which  may 
in  some  cases  be  unfortunately  few  in  number,  and  which  always  refer 
to  a  past  time  and  are  therefore  more  or  less  obsolescent. 

4.4.  Efficient  Formulas  for  Utilizing  Results 

The  formulas  given  in  preceding  parts  of  this  chapter  have  properties 
of  maximum  efficiency  and  accuracy  of  interpretation,  provided  that  the 
actual  distribution  of  bomb  coordinates  under  homogeneous  conditions 
is  of  the  bivariate  normal  form.  This  form  of  distribution  is  strongly 
suggested  by  general  mathematical  theory  based  on  the  idea  of  the  errors 
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as  resultants  of  a  large  number  of  independent  causes.  It  is  approxi- 
mated by  various  records  of  actual  bombing,  the  principal  deviation 
being  a  slight  skewness  of  the  range  errors  due  to  a  small  proportion  of 
defective  bombs  that  tumble  and  fall  short.  For  distributions  having 
moderate  deviation  from  normality  the  statistical  efficiency  of  the 
methods  based  on  the  assumption  of  normality  is  still  nearly  100  percent. 

For  the  normal  distribution  in  one  or  more  variates,  it  is  known 
that  all  the  information  supplied  by  any  random  sample  with  respect 
to  the  unknown  parameters  of  the  distribution  is  contained  in  the  sample 
means  and  mean  squares  of  the  individual  variates,  and  in  the  mean 
products  of  pairs  of  variates.  Only  these  functions  of  the  observations, 
and  functions  of  these  functions,  are  capable  of  full  efficiency  in  estimation 
of  the  unknowns.  If  any  other  function  of  the  observations  is  used  for 
estimation  or  for  the  testing  of  any  hypothesis  regarding  the  unknown 
parameters,  there  is  a  reduction  in  accuracy.  If  the  use  of  an  inefficient 
statistical  method  is  insisted  upon,  the  loss  of  accuracy  can  be  made  good 
by  increasing  the  number  of  observations  in  a  proportion  inverse  to  the 
efficiency  of  the  method  in  question. 

These  general  principles  are  illustrated  by  the  use  of  the  mean 
deviation,  or  mean  of  absolute  values  of  errors,  as  a  measure  of  dispersion, 
by  comparison  with  the  standard  deviation,  or  by  the  median  in  com- 
parison with  the  mean.  If  a  variate  x,  for  example  the  deflection  error 
of  bombs  dropped  under  certain  conditions,  is  normally  distributed,  this 
means  that  the  element  of  probability  is 

1  (*-s)2 

2*2    dx 


that  is,  the  integral  of  this  expression  between  any  two  limits  is  the 
probability  of  x  falling  between  these  limits.  This  probability  distribu- 
tion involves  two  parameters,  namely,  the  mean  value  £  and  the  standard 
deviation  a,  measuring  respectively,  in  the  example  of  deflection  errors, 
the  deflection  component  of  the  mean  point  of  impact  of  all  bombs 
dropped  under  the  specified  conditions,  and  dispersion  about  this  mean 
point  of  impact.  The  square  of  the  standard  deviation  is  known  as  the 
"variance";  this  parameter  o-2  is  the  mean  value  of  the  squares  of  devia- 
tions from  the  mean. 

When  we  have  N  observations  Xi,  x2,  .  .  .  ,  xN  of  the  variate  x,  we 
may  calculate  from  them  various  functions,  each  known  as  a  "statistic," 
designed  to  estimate  the  unknown  parameters  £  and  a.  Thus  £  may  be 
estimated  by  the  arithmetic  mean 

-  -  Xl  it"   '  '  I   +  Xn 
x  ~  N 
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or  alternatively  by  the  median  m,  which  is  denned  as  the  middle  observa- 
tion (when  the  number  N  is  odd)  when  the  observations  are  arranged  in 
order  of  size.  A  set  of  N  independent  observations  is  called  a  " sample." 
If  from  many  samples  of  N  we  form  the  means  x,  these  will  have  a  dis- 
tribution of  their  own,  which  happens  to  be  normal  also.  The  mean  of 
this  new  normal  distribution  has  the  same  value  £  as  the  old.  But  instead 
of  the  variance  a2,  the  distribution  of  means  has  for  its  variance  the 
reduced  value  a2/N.  This  reduction  in  variance  measures  the  greater 
accuracy  in  averages  by  comparison  with  single  observations. 

The  distribution  of  medians  from  samples  of  N  is  not  of  the  normal 
form  but  approximates  it  if  N  is  large.  The  variance  of  this  distribution 
of  medians  is  less  than  that  of  x  but  greater  than  that  of  the  means  x. 
The  variance  of  medians  is  in  fact  approximately  ir/2,  or  1.5708,  times  the 
variance  of  means  from  samples  of  the  same  size  when  N  is  large.  The 
inefficiency  of  the  median  is  expressed  by  this  greater  dispersion,  or  by 
considering  the  relative  sample  numbers  required  to  obtain  the  same 
accuracy  by  the  median  as  by  the  mean.  Taking  the  latter  point  of  view, 
and  letting  N0  be  the  sample  size  to  be  used  when  the  arithmetic  mean  is 
taken  as  the  estimate  of  £,  while  JVi  observations  (for  example,  bombs) 
are  to  be  used  when  the  median  is  taken  instead  of  the  mean,  we  have  for 
the  two  variances 

2    _     (J2  2        _     7T0T2 

If  the  two  variances  are  to  be  equal,  the  ratio  of  the  sample  numbers  is 

^  =  ?  =  0.6366 

Consequently,  the  efficiency  of  the  median  is  said  to  be  63.66  percent. 
The  efficiency  of  the  mean  has  been  demonstrated  to  be  100  percent; 
that  is,  its  variance  has  the  smallest  possible  value. 

The  situation  is  similar  with  respect  to  estimates  of  variance.  There 
are  infinitely  many  statistics  whose  mean  values  are  equal  to  a2.  Among 
all  these,  the  particular  statistic 

,       S{x  -  x)2 


N  -  1 


(where  S  stands  for  summation  over  the  sample)  enjoys  the  property 
that  its  distribution  has  a  smaller  variance  than  any  of  the  others.  In 
particular,  when  the  mean  error  (that  is,  the  arithmetic  mean  of  the 
absolute  values  of  the  deviations  from  the  mean)  is  multiplied  by  a  factor 
such  that  the  mean  value  of  the  resulting  statistic  is  <r,  this  statistic  has  a 
distinctly  wider  dispersion  than  s  when  the  sample  size  is  the  same.     Both 
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these  statistics  have  distributions  approaching  normality  of  form  when 
N  is  large,  with  the  efficiency  ratio 

— i-s  -  0.87597 
x  —  Z 

The  wide  current  use  of  the  mean  deviation  thus  has  an  efficiency  of  only 
about  87.6  percent.  If  bombs  are  dropped  for  the  purpose  of  ascertaining 
their  dispersion,  the  use  of  the  mean  deviation  is  equivalent  to  wasting 
about  one  bomb  in  eight. 

Against  this  waste  of  bombs  may  be  set  the  fact  that  the  mean  devia- 
tion is  easier  to  calculate  than  the  standard  deviation  s.  The  labor  of 
calculating  s  is,  however,  much  less  than  is  sometimes  supposed  and  is 
trivial  if  a  calculating  machine  of  any  kind,  or  even  a  table  of  squares, 
is  at  hand.     The  identity 

S(x  —  x)2  =  Sx2  —  xSx 

means  that  the  individual  deviations  from  the  mean  need  not  be  com- 
puted. In  view  of  the  high  cost  of  aerial  bombing,  it  appears  definite 
that  the  use  of  mean  deviations,  and  of  functions  of  them,  ought  to  be 
eliminated.  * 

4.5.  Exact  Probabilities 

The  standard  deviation  s  defined  above  has  another  advantage  over 
the  mean  deviation  and  all  other  methods  of  estimating  a  in  that  the 
exact  distribution  of  s  for  samples  of  every  size,  even  the  smallest,  is  a 
known  elementary  mathematical  function,  and  tables  of  its  probability 
integral  are  available.  It  also  has  numerous  other  practical  advantages 
in  more  complicated  situations,  such  as  those  involving  least  squares 
or  the  analysis  of  variance.  In  view  of  the  small  samples  that  must 
sometimes  be  used,  the  property  of  having  readily  determinable  exact 
probabilities  is  a  valuable  one.  Asymptotic  formulas  are  somewhat 
dubious  in  application  to  particular  cases  unless  a  precise  limit  of  error 
is  at  hand  and  are  thus  inferior  to  exact  probabilities. 

The  choice  of  suitable  statistics  is  also  influenced  by  another  and 
more  subtle  criterion  of  accurately  determinable  probabilities.  This 
is  the  desideratum  that  in  testing  any  hypothesis  we  should  use  probabili- 
ties that  depend  on  nothing  but  the  observations,  together  with  any 
numbers  that  may  be  specified  in  the  hypothesis.  Statisticians  have 
often,  through  lack  of  existence  or  knowledge  of  better  methods,  used 
formulas  for  standard  errors  and  other  measures  of  accuracy  in  which 
were  involved  unknown  parameters.     The  practice  has  been  to  substitute 

*  See  Fisher,  "Mean  Error  and  Mean  Square  Error."  For  a  discussion  of  the 
general  theory  of  the  efficiency  of  statistics,  see  Fisher,  "Theory  of  Statistical  Estima- 
tion," and  "Statistical  Theory  of  Estimation," 
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for  these  parameters  estimates  of  them  obtained  from  the  sample.  This 
substitution  however  entails  an  additional  sampling  error  not  under  con- 
trol. Modern  statistical  theory  has  succeeded  in  devising  various  ways  of 
overcoming  this  difficulty.  The  statistical  methods  recommended  in 
this  chapter  all  fulfill  the  criterion  that  the  associated  probabilities  are 
exact,  free  from  unknown  nuisance  parameters.  This  is  true  in  particular 
of  the  measures  T0,  TM,  and  TD  combining  range  with  deflection  errors. 
If  the  new  observations  are  from  the  same  population  as  the  old  and  if 
this  has  a  normal  distribution,  then  the  distribution  of  each  of  these  T's 
is  absolutely  independent  of  all  unknown  parameters.  Only  the  sample 
numbers  are  involved,  and  these  are  of  course  known.  To  pass  from  the 
observations  to  the  relevant  exact  probabilities  through  these  T's  is  a 
purely  mathematical  operation,  involving  no  element  of  judgment.  It 
is  true  that  the  available  tables  do  not  cover  fully  the  T  distribution, 
so  that  asymptotic  approximations  must  be  used  for  the  present,  but  this 
is  a  defect  that  can  be  remedied  with  only  a  little  more  mathematical 
research  and  calculation.  The  probabilities  respecting  TM  can  be  found 
easily  and  accurately  from  existing  published  tables  when  only  one  flight 
is  involved,  but  the  combination  of  these  values  for  different  flights  leads 
to  a  more  general  distribution,*  which  is  also  applicable  to  TD  and  T0. 

4.6.  Reasons  for  the   T  Method  of  Combining  Range  and  Deflection 
Errors 

The  particular  method  of  combining  range  with  deflection  errors 
represented  by  the  T  formulas  may  be  questioned.  To  many  it  will  seem 
more  natural  to  use  measures,  apparently  simpler,  based  on  the  simple 
distances  from  the  bombs  to  the  target.  After  all,  it  will  be  said,  the 
ultimate  value  of  a  bomb  aimed  at  the  enemy  depends  only  on  the  simple 
geometrical  distance  from  its  point  of  impact  to  the  target,  and  not  on  any 
figurative  measure  of  distance  that  is  constant  along  ellipses  instead  of 
circles.  The  simple  distance  has  a  clear  meaning,  intelligible  to  everyone. 
It  can  be  measured  on  a  map  without  reading  off  coordinates.  Why  not, 
then,  use  a  criterion  of  excellence  of  bombing  depending,  in  an  inverse 
sense,  only  on  the  distances  of  the  bombs  from  the  target? 

The  several  distances  might  be  combined  by  simple  averaging,  or  by 
taking  the  mean  square,  or  in  numerous  other  ways.  Use  of  the  mean 
square,  or  the  sum  of  the  squares,  of  the  distances,  has  a  special  attrac- 
tion in  that  the  sum  of  the  squares  of  the  distances  of  the  bombs  from 
the  target  equals  the  sum  of  the  squares  of  their  distances  from  the  mean 
point  of  impact,  plus  the  product  of  the  number  of  bombs  by  the  square 
of  the  distance  from  the  mean  point  of  impact  to  the  target.     This 

*  This  mathematical  gap  has  been  filled  in  since  the  above  text  was  written.  The 
results  are  expected  to  appear  in  the  Annals  of  Mathematical  Statistics. 
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relation,  which  is  analogous  to 

n  =  n  +  n 

makes  it  possible  to  examine  separately  two  components  of  the  total  sum 
of  squares  that  may  be  due  to  quite  different  causes.  Furthermore, 
estimates  of  the  circular  probable  error  and  similar  parameters  have 
greater  statistical  efficiency,  that  is,  smaller  standard  errors  for  a  given 
number  of  observations,  if  based  on  the  sum  of  squares  instead  of  the 
simple  sum  or  mean  of  the  distances. 

But  the  efficiency  even  of  the  mean  square  circular  error  for  estimating 
the  general  degree  of  dispersion  is  less  than  that  of  T0  or  TD,  provided 
the  base  period  sample  is  representative  of  the  true  distribution  as 
regards  the  ratios  of  the  variances  of  range  and  deflection,  and  as  regards 
the  correlation.  If  the  true  mean  point  of  impact  is  unknown  or  if  we 
choose  to  ignore  it,  the  best  available  measure  of  general  degree  of  dis- 
persion is  TD  or  a  function  of  TD.  If  the  true  mean  point  of  impact  could 
be  assumed  to  be  at  the  target,  the  best  measure  of  general  degree  of 
dispersion  would  be  To  or  a  function  of  T0.  Such  measures  have  smaller 
variances  than  measures  of  the  same  parameter  based  on  the  mean  square 
circular  error  in  a  ratio  which,  if  we  can  trust  the  experience  described 
in  Sec.  5,  is  slightly  less  than  82  percent.  This  implies  that  the  estima- 
tion of  dispersion  can  be  carried  out  with  about  equal  accuracy  with  82 
bombs  if  TD  or  To  is  used,  or  with  100  bombs  if  the  mean  square  circular 
error  is  used.  The  simple  mean  of  distances  from  the  target  or  from  the 
MPI,  or  any  estimate  involving  mean  deviations,  would  require  distinctly 
more  than  100  bombs  to  gain  the  same  accuracy  obtainable  with  82 
bombs  when  T  is  used. 

The  seemingly  paradoxical  result  emerges  that,  to  obtain  the  best 
possible  estimate  of  the  mean  square  deviation  from  the  target  to  be 
expected  in  later  bombing  under  like  conditions,  the  mean  square  devia- 
tion from  the  target  in  the  observed  cases  should  not  be  used.  Likewise, 
to  estimate  the  mean  deviation  from  the  target  as  accurately  as  possible 
for  the  future,  the  mean  deviation  from  the  target  in  the  past  should  not 
be  used.  The  same  is  true  of  shooting  and  of  all  other  situations  in 
which  the  joint  distribution  of  two  or  more  variates  does  not  have 
circular  or  spherical  symmetry.  Indeed,  for  any  given  set  of  ratios 
among  the  variances  and  covariances,  the  mean  square  deviation  from 
the  target  and  the  mean  deviation  from  the  target  for  the  population  of 
bombs  or  shots,  from  which  our  observations  constitute  a  sample,  are 
definite  multiples  of  a  parameter  measuring  general  degree  of  dispersion ; 
and  such  a  parameter  is  best  estimated  by  a  quadratic  expression  of  the 
T2  type. 

For  detection  of  bias  in  the  mean  point  of  impact,  the  use  of  Tm 
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has  advantages  over  all  other  methods  that  have  been  studied.*  It  is 
true  that  when  only  a  bias  in  a  specific  direction,  such  as  that  of  range 
errors,  is  to  be  considered  and  it  is  certain  that  there  is  no  bias  in  any  other 
direction,  the  mean  of  the  components  of  deviation  (positive  and  negative) 
in  the  particular  direction  is  the  most  efficient  measure,  while  the  use  of  a 
measure  of  bias  in  all  directions  weakens  somewhat  the  test  for  a  bias 
in  the  particular  direction.  This  difficulty  is,  however,  inescapable 
when  there  is  any  real  importance  and  likelihood  of  errors  of  bias  in  more 
than  one  direction.  If  a  rule  were  established  that  a  bomb  should  be 
classed  as  "bad"  if  its  range  error  were  such  as  to  be  exceeded  with 
probability  0.01  and  it  should  also  be  classified  as  bad  if  the  deflection 
error  were  such  as  to  be  exceeded  with  probability  0.01,  there  would  be 
some  danger  of  misunderstanding.  Many  persons  would  suppose  that 
the  probability  of  a  good  bomb  being  wrongly  classified  as  bad  under 
such  circumstances  is  0.01,  whereas  actually  this  probability  is  in  the 
neighborhood  of  0.02.  Such  erroneous  ideas  abound  in  statistical 
practice  in  many  fields.  Use  of  the  combined  measure  TM  avoids  this 
difficulty. 

The  T  statistics  are  algebraic  invariants  and  as  such  possess  certain 
desirable  properties.  Any  transformation  of  coordinates  such  as  a 
rotation  of  axes  or  a  change  in  units  of  measure,  if  applied  both  to  the 
old  and  to  the  new  observations,  leaves  each  of  the  T's  invariant  and 
therefore  does  not  affect  the  probabilities  involved.  This  same  property 
of  invariance  is  also  important  in  obtaining  the  relevant  probability 
formulas.  It  implies  the  very  important  fact  that  the  probabilities 
associated  with  the  T's  are  exact  numbers,  not  functions  of  nuisance 
parameters  whose  values  are  never  exactly  known. 

There  may  be  a  thought  that  instead  of  T  another  invariant  known 
as  the  "generalized  variance"  might  be  used.f  The  generalized  variance 
has  many  important  potential  uses.  However,  it  is  not  so  suitable  as 
T  for  the  purposes  here  in  view  because,  in  two  dimensions,  it  is  essentially 
a  measure  of  distance  (in  a  generalized  sense)  from  a  line  instead  of  from 
a  point.  While  TM  and  To  can  be  zero  only  if  all  the  bombs  fall  on  the 
target,  and  TD  can  be  zero  only  if  all  the  bombs  fall  on  one  point,  the 
generalized  variance  will  be  zero  if  all  the  bombs  fall  on  any  one  line,  no 
matter  how  they  are  scattered  along  it.  This  consideration  also  elimi- 
nates for  the  present  purpose  several  other  statistics  obtained  by  maxi- 
mum likelihood  methods.  J 

An  objection  likely  to  be  made  to  the  T  formulas  is  against  the  inclu- 
sion of  the  term  in  the  product  of  range  by  deflection  errors.     If  the 

*  See  Hsu,  "Generalized  T";  Wald,  "Tests  Concerning  Several  Parameters." 
f  Wilks,  "Generalizations  in  the  Analysis  of  Variance." 
J  Pearson  and  Wilks,  "k  Samples  of  Two  Variables." 
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correlation  between  range  and  deflection  in  the  sample  in  the  base  period 
were  exactly  zero,  this  term  would  be  zero.  It  may  be  argued  on  a  priori 
grounds  that  the  true  correlation  is  zero,  and  that  when,  as  in  the  sample 
here  used,  the  observed  correlation  is  only  slightly  greater  than  its  stand- 
ard error  (Sec.  5  gives,  after  adjustments,  a  correlation  of  —0.07535,  which 
is  1.2  times  its  standard  error),  later  work  should  proceed  on  the  fixed 
assumption  that  the  correlation  is  zero.  A  further  extension  of  this  idea 
might  lead  to  ignoring  the  difference  between  the  variances  of  range  and 
deflection,  and  consequently  using  the  mean  square  deviation,  whose 
inefficiency  has  been  described  above. 

While  omission  of  the  small  product  term  from  T2  would  seem  at 
first  to  save  a  little  computational  labor,  it  would  lead  to  great  mathe- 
matical complexities  in  connection  with  the  probabilities.  The  resulting 
function  is  not  an  invariant,  and  its  distribution  involves  as  a  nuisance 
parameter  the  ratio  of  range  variance  to  deflection  variance  in  the  popula- 
tion, which  is  estimated  by  the  sample  only  with  an  uncertainty  that 
makes  highly  dubious  any  probabilities  calculated  on  this  basis.  More- 
over, if  it  should  turn  out  that  there  really  is  some  correlation  between 
range  and  deflection  errors,  whether  the  causes  and  magnitude  of  such 
correlation  are  known  or  unknown,  this  correlation  would  constitute  an 
additional  nuisance  parameter  having  serious  effects  on  the  accuracy 
of  judgments  based  on  the  observations.  In  this  connection  it  should 
be  recalled  that,  while  the  particular  sample  here  considered  does  not 
establish  existence  of  a  correlation,  other  reports  seem,  at  least  super- 
ficially, to  show  high  and  significant  correlations  between  range  and 
deflection  errors  in  practice  bombing.  Some  officers  are  of  the  opinion 
that  a  real  correlation  exists. 

Another  way  of  avoiding  the  calculation  of  products  of  range  by 
deflection  errors  would  be  to  make  separate  tests  of  significance  for  range 
and  deflection,  using  the  variance  ratio  distribution,  and  then  to  combine 
these  by  the  method  of  combining  independent  test  probabilities  pro- 
pounded by  Fisher  and  studied  by  Wallis  and  others.*  This  method 
requires  use  of  the  exact  probabilities  rather  than  of  the  values  of  the 
variance  ratio  at  a  few  points  such  as  those  for  probabilities  0.01  and 
0.05.  Consequently,  the  tables  of  the  F  distribution  must  be  used  twice 
for  every  such  test  and  thus  for  every  point  plotted  on  control  charts 
corresponding  to  those  for  TV,  TD,  and  T0.  The  labor  involved  would 
amount  to  far  more,  and  the  assumptions  necessary  for  validity  are  more 
stringent,  by  this  than  by  the  T  methods. 

The  methods  based  on  T  are  valid  and  accurate  whether  a  true  cor- 
relation exists  or  not,  and  whatever  the  ratio  of  the  true  variances  may  be. 

*  Fisher,  Statistical  Methods,  Sec.  21.1  of  Chap.  4;  Wallis,  " Compounding 
Probabilities." 
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The  only  limitations  on  this  statement  are  in  the  assumptions  of  normality 
and  independence,  which  are  made  throughout,  not  only  in  the  present 
discussion  but  in  virtually  the  whole  literature  of  bombing  and  firing, 
including  that  concerned  with  methods  of  combining  observations. 
These  assumptions,  in  relation  to  test  bombing,  are  discussed  in  Sec. 
3.1.4,  pp.  146-147. 

5.  STATISTICAL  ANALYSIS  OF  CERTAIN  TEST  BOMBINGS 

5.1.  Summary  of  the  Data 

In  order  to  obtain  variances  in  range  and  deflection,  and  a  correlation, 
from  which  the  coefficients  a,  b,  c  in  T2  could  be  found,  a  series  of  65 
flights  was  analyzed.  These  consisted  of  all  those  flights  at  a  certain 
proving  ground  betAveen  November  4,  1942,  and  December  23,  1942, 
during  which  exactly  four  bombs  were  dropped.  The  coordinates 
of  the  point  of  impact  of  each  bomb  with  respect  to  axes  through  the 
target,  of  which  one  is  an  average  track  for  the  whole  flight,  were  supplied 
by  the  proving  ground,  together  with  the  heading  of  the  airplane  when 
the  bomb  was  dropped  and  other  data.  Two  of  the  bombs  on  each 
flight  were  dropped  while  the  airplane  was  moving  in  a  generally  northerly 
direction  and  the  other  two  while  it  was  flying  in  a  southerly  direction. 
The  actual  headings,  however,  varied  considerably,  the  difference  between 
the  two  in  the  same  general  direction  on  the  same  flight  being  commonly 
several  degrees  and  in  one  case  amounting  to  25  degrees.  We  therefore 
applied  to  the  coordinates  received  a  rotation  designed  to  give  the  errors 
in  range  and  deflection  with  respect  to  the  actual  heading  of  the  airplane. 
Table  3.1  gives  the  principal  statistics  summarizing  both  the  original 
data  on  range  and  deflection  and  the  values  as  adjusted  by  these  rotations. 
Except  for  the  first  column  of  this  one  table,  all  numerical  results  given 
hereinafter  will  refer  to  the  rotated  values.  In  each  case  r  denotes  the 
error  in  range  and  d  the  error  in  deflection.  Positive  values  of  r  refer  to 
bombs  overshooting  the  target,  negative  values  of  r  to  bombs  that  fall 
short.  Deflection  errors  to  the  right  are  considered  positive;  those  to 
the  left  are  negative.  The  unit  of  measure  is  in  each  case  the  mil, 
about  1/1,000  of  a  radian,  or  about  2/35  of  a  degree.  The  data  as 
received  refer  to  actual  impacts,  uncorrected  for  differential  ballistic 
wind,  nonstandard  altitude,  nonstandard  air  speed,  or  range  component 
of  crosstrail.  The  actual  altitudes  are  understood  to  have  fluctuated 
rather  widely  about  7,000  ft.,  but  since  the  errors  are  measured  in  mils 
the  effects  of  these  altitude  variations  on  the  results  of  the  analysis 
should  not  be  great. 

If  any  one  rotation  had  been  applied  to  all  the  points  of  impact, 
the  squared  distance  f2  +  d2  of  the  mean  point  of  impact  from  the  target 
would  be  invariant.     The  fact  that  f2  +  d2  has  two  different  values,  8.83 
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for  the  original  and  9.67  for  the  rotated  data,  results  from  the  differences 
of  the  rotations  applied  for  the  different  bombs. 

TABLE  3.1 
Summary  of  Bombing  Errors  in  65  Flights  (260  Bombs) 


Mean  range  error*  (over),  f 

Mean  deflection  error*  (right),  d 

Standard  deviation  of  range  errors,  sr 

Standard  deviation  of  deflection  errors,  Sd. 
Correlation  of  range  with  deflection  errors . 


Original 
(mils) 


0.5108 
2.9292 
6.8596 
8.7195 
•0.037 


Rotated 
(mils) 


1.3450 
2.8046 
6 . 9074 
8.6191 
0.035 


*  These  are  means  of  positive  and  negative  errors,  not  of  their  absolute  values.  The  first  two  lines  in 
Table  3.1  would  give  the  coordinates  of  the  mean  point  of  impact  (MPI)  of  all  the  260  bombs  if  the  same 
course  had  been  flown  in  each  case.  They  reflect  biased  error,  whereas  the  next  two  lines  give  measures 
of  the  unbiased  or  compensating  errors,  often  measured  less  accurately  by  the  "mean  deviation,"  or 
mean  of  absolute  values  of  errors,  which,  to  be  comparable  with  the  standard  deviation,  would  need  to 
be  multiplied  by  a  factor  that  for  large  samples  approximates  -\/ir/2  =  1.253. 

Both  the  mean  range  error  and  the  mean  deflection  error  (that  is, 
the  means  of  the  actual  values  of  the  errors,  positive  and  negative, 
not  of  the  absolute  values)  calculated  from  the  rotated  data  are  statis- 
tically significant  relative  to  their  standard  errors.  Thus  it  is  clear  that 
there  were  biased  errors  throwing  the  mean  point  of  impact  (MPI)  beyond 
and  to  the  right  of  the  target.  The  bias  in  deflection  could  have  been 
detected  from  the  original,  unrotated  observations,  but  in  them  the  bias 
in  range,  being  smaller,  was  obscured  by  the  addition  of  the  variance 
among  the  four  headings  on  the  same  flight  to  the  variance  due  to  other 
causes. 

As  in  various  other  bombing  records,  the  standard  deviation  in 
deflection  is  greater  than  that  in  range.  The  excess  is  24.8  percent  in 
the  present  sample  and  is  statistically  significant.  The  appropriate 
test  is  in  terms  of  F  =  (sd/sr)2,  which  equals  1.6158  for  the  original 
and  1.5570  for  the  rotated  data.  With  the  259  degrees  of  freedom  in  each 
of  the  variance  estimates,  the  probability  that  F  should  exceed  1.470  is 
0.001.  When,  as  here,  the  greater  variance  is  divided  by  the  smaller, 
the  probability  is  less  than  1  in  500  of  obtaining  so  great  a  ratio  of  stand- 
ard deviations  as  that  actually  observed  if  the  true  standard  deviations 
are  equal.  At  the  customary  0.05  probability  level,  confidence  limits 
for  the  percentage  by  which  the  standard  deviation  in  deflection  exceeds 
that  in  range  are  12.6  percent  and  38.2  percent. 

The  correlations  —0.037  and  —0.035  are  completely  insignificant, 
since  their  standard  error  is  1/V259  =  0.0621.  This  of  course  does  not 
definitely  prove  that  range  and  deflection  errors  are  mutually  inde- 
pendent; it  only  fails  to  disprove  it. 
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Among  the  65  flights  there  were  variations  with  respect  to  both 
personnel  and  equipment.  Ten  different  teams,  each  consisting  of  a  pilot 
and  a  bombardier,  were  represented.  The  number  of  flights  made  by  a 
team  varied  from  1  to  17  in  the  several  cases.  Automatic  Erection 
System  (AES)  was  used  in  24  cases  and  Stabilized  Bombing  Approach 
Equipment  (SBAE)  in  8  cases.  The  distribution  of  the  flights  in  relation 
to  personnel  and  equipment  is  shown  in  Table  3.2.  The  first  columns 
designate  the  pilot  and  the  bombardier  by  their  initials  and  indicate  the 
combination  of  the  two  by  a  number. 

TABLE  3.2 
Distribution  of  Flights  by  Personnel  and  Equipment 


Bombard- 
ier 

Team 
number 

Number  of  cases 

Pilot 

Neither 

AES  nor 

SBAE 

AES 
only 

SBAE 
only 

Both 

AES  and 

SBAE 

Total 
for  team 

WT 

W 

R 

W 

R 

K 

K 

K 

W 

K 

H 
S 
R 
H 
H 
U 
H 
L 
R 
R 

1 
2 
3 

4 
5 
6 
7 
8 
9 
10 

10 

6 
4 
1 
7 
1 
4 
2 
1 
1 

5 
3 

4 

4 
2 
2 

2 
1 

1 

3 

1 

17 
10 
7 
3 
11 
1 
8 
4 
3 
1 

Totals . . 

37 

20 

4 

4 

65 

5.2.  Analysis  of  the  Data 

The  primary  object  of  the  analysis  was  to  obtain  suitable  coefficients 
a,  b,  and  c  for  use  in  the  expressions  for  T2  that  are  to  measure  the  accu- 
racy of  individual  bombsights.  However,  the  variances  entering  into 
these  expressions  are  presumably  affected  by  the  variations  of  equipment 
and  of  personnel  among  the  flights  examined.  It  is  to  be  expected  that 
this  heterogeneity  will  increase  the  sample  variances  above  the  values 
they  would  have  if  equipment  and  personnel  had  been  kept  constant  on 
all  the  flights.  Such  an  increase  of  variance  would  reduce  the  values  of 
a,  b,  and  c,  and  therefore  the  values  of  T2  computed  from  them  in  future 
tests,  giving  an  illusory  impression  of  accuracy  in  those  tests  in  which 
equipment  and  personnel  are  close  to  the  average  situation  in  the  cases 
analyzed.  To  determine  whether  this  is  a  serious  bias  and  to  correct 
for  it  if  it  is,  it  seemed  desirable  to  consider  regression  equations  for 
predicting  the  range  and  deflection  errors  in  terms  of  equipment  and 
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personnel.  At  the  same  time,  the  opportunity  was  at  hand  to  obtain 
tentative  estimates  characterizing  the  performance  of  particular  teams 
and  of  the  particular  types  of  equipment  and  to  test  the  accuracy  of 
these  estimates. 

Several  different  regression  equations  were  used.  In  the  first  instance, 
to  make  a  rather  extensive  elimination  of  extraneous  factors,  we  con- 
sidered regression  formulas  with  no  less  than  79  terms.  Besides  an 
additive  constant,  these  terms  were 

Xi,  defined  as  1  if  AES  Avas  used,  and  otherwise  0 

X2,  defined  as  1  if  SBAE  Avas  used,  and  otherwise  0 

xz,  defined  as  1  if  the  airplane  Avas  flying  north,  but  0  if  it  was  flying 

south 
x^,  (i  =  1,  2,   •  •  •  ,  10),  equal  to  1  if  the  bomb  Avas  dropped  by  the 

^th  team,   and  otherwise  0.     The  number  of  variables  in  this 

group  was  of  course  10,  the  number  of  teams. 
x"j,  (i  —  1,  2,   •   •   •  ,10),  equal  to  1  if  the  bomb  was  dropped  on  the 

jih  of  the  flights  of  the  ^th  team,  and  otherwise  0.     The  number 

of  variables  in  this  group  was  65,  the  same  as  the  number  of  flights. 

Two  different  predictands,  the  range  and  deflection  errors,  were  used  with 
this  same  set  of  predictors.  The  regression  equation  for  estimating 
range  error  in  terms  of  all  the  predictors  was  of  the  form 

10 

R  =  b0  +  biX!  +  b2x2  +  63x3  +  £  bW  +  £  &'/<■ 

i  =  l 

with  a  similar  formula  for  estimated  deflection  error  D.  The  coefficients 
were  determined  by  the  method  of  least  squares. 

It  might  seem  that  the  solution  of  a  least-square  problem  Avith  79 
unknoAvns  would  be  an  utterly  fantastic  task,  to  say  nothing  of  two  such 
solutions.  Yet  through  special  circumstances  it  was  possible  not  only  to 
carry  through  this  calculation  completely,  with  the  relevant  standard 
errors  and  tests  for  significance  in  terms  of  probability,  but  to  do  so 
with  a  very  modest  amount  of  computing.  One  of  these  circumstances 
was  that,  because  of  the  nature  of  the  variables,  a  large  majority  of  the 
elements  of  the  matrix  of  the  normal  equations  were  zero.  Also,  a 
simplification  Avas  obtained  through  choice  of  13  linear  relations  that 
can  be  imposed  somewhat  arbitrarily  upon  the  regression  coefficients 
because  of  13  identical  linear  relations  among  the  predictors,  Avhich 
reduce  the  rank  of  the  79-rowed  matrix  to  66.  These  13  identical  rela- 
tions express  the  following  facts: 

The  sum  of  the  values  of  x%  for  all  the  flights  of  the  ith  team  equals 
x\.     There  are  10  of  these  relations,  corresponding  to  the  10  teams, 
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The  sum  x[  +  xf2  +   •  •  •   +  x'10  —  1 

The  sum  of  the  values  of  x^  for  all  flights  using  AES  is  x\ 

The  sum  of  the  values  of  seJJ  f°r  an  flights  using  SBAE  is  :r2 

A  linear  constraint  corresponding  to  each  of  these  was  placed  upon 
the  regression  coefficients  in  each  group  in  order  to  make  them  deter- 
minate. Though  it  is  customary  in  problems  of  this  kind  to  use  the 
simplest  constraints  by  merely  equating  to  zero  the  sum  of  the  coefficients 
in  each  group,  we  chose  a  different  set  of  constraints  adapted  to  simplify- 
ing the  numerical  work.     For  example,  where  a  constraint  of  the  form 

b[  +  K  +   •  v  •   +  b'1Q  =  0 
would  ordinarily  be  imposed,  we  used  instead 

176;  +  106J  +  7b'3  +   •  •  •   +  b[0  =  0 

with  coefficients  equal  to  the  numbers  of  flights  of  the  respective  teams. 
This  made  possible  a  great  reduction  in  labor  because  of  the  occurrence  of 
this  particular  linear  combination  in  the  normal  equations.  All  results  of 
importance  involving  the  b"s  will  necessarily  be  in  terms  of  their  differ- 
ences and  do  not  seem  to  be  affected  by  replacing  one  such  constraint  by 
another. 

The  solution  of  the  normal  and  constraining  equations  was  carried 
out  literally,  in  terms  of  symbols  for  the  right-hand  members  of  the  normal 
equations,  instead  of  numerically.  In  this  way  it  was  easy  to  obtain 
the  two  regression  equations,  one  for  range  and  the  other  for  deflection 
errors,  by  substituting  the  two  sets  of  right-hand  members.  Further- 
more, the  literal  solution  made  easy  the  determination  of  standard 
errors  and  correlations  among  the  regression  coefficients  in  either  equation. 
Thus  it  played  somewhat  the  same  role  as  that  of  the  inverse  matrix  in 
least-square  problems  involving  no  identical  relations  or  constraints. 

The  question  whether  there  was  significant  variation  among  the  10 
teams  with  regard  to  bias  in  range  and  deflection  errors  was  tested  by 
means  of  the  variance  ratio  distribution.  The  denominator  of  this  ratio 
was  the  estimate  of  variance  based  on  the  sum  of  squares  of  residuals 
from  the  regression  equation.  Since  there  were  260  observations  and  79 
regression  coefficients  connected  by  13  constraints,  the  number  of  degrees 
of  freedom  for  this  residual  variance  was  260  —  (79  —  13)  =  194.  The 
numerator  was  a  quadratic  form  in  the  b"s  with  9  degrees  of  freedom, 
since  they  are  subject  to  the  one  relation  indicated  above.  The  value 
of  such  a  quadratic  form  can  be  found  by  first  ignoring  any  one  of  the  b"s 
(which  we  arbitrarily  chose  as  b[),  inverting*  the  covariance  matrix  of 

*  Several  methods  of  numerical  inversion  of  matrices  are  described  in  Dwyer, 
"Solution  of  Simultaneous  Equations"  and  Hotelling,  "Matrix  Calculation."  The 
method  given  by  Dwyer  is  particularly  valuable. 
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the  other  nine  corresponding  to  unit  variance  of  residuals,  and  then 
calculating  straightforwardly  the  quadratic  form  of  which  this  inverse 
is  the  matrix  and  in  which  the  variables  are  the  nine  6"s  used.  This 
has  only  to  be  divided  by  9  to  give  the  numerator  of  the  variance  ratio, 
essentially  an  estimate  of  variance  among  the  6"s,  but  differing  from  a 
mere  sum  of  squares  of  their  deviations  from  their  mean  because  of  their 
unequal  weights  and  correlations  resulting  from  the  unbalanced  dis- 
tribution of  equipment  and  personnel.  The  inverse  matrix  is  the  same 
for  range  as  for  deflection  errors,  since  it  depends  only  on  the  distribution 
of  the  predictors  and  not  on  the  predictands. 

For  range  errors  the  variance  ratio  F  found  in  this  way  was  1.423; 
for  deflection  errors  it  was  1.426.  These  are  considerably  less  than  the 
value  1.923  of  F  that  would  just  be  regarded  as  significant  on  the  cus- 
tomar}^  0.05  level  and  indeed  are  little  greater  than  the  value  that  would 
be  exceeded  by  chance  with  probability  0.20  if  the  true  mean  range  and 
deflection  deviations  were  the  same  for  all  teams.  Thus  no  evidence 
pointing  to  any  variation  among  teams  in  these  biased  errors  emerges 
from  this  test. 

With  two  variates  such  as  range  and  deflection  errors  the  two  separate 
tests  may,  with  a  substantial  increase  in  accuracy,  be  replaced  by  a 
single  test.  In  place  of  the  foregoing  variance  ratios  we  may  well  con- 
sider a  combined  test  due  to  Wilks.*  Each  variance  is  replaced  in 
this  test  by  a  symmetric  two-rowed  determinant.  The  elements  of  the 
denominator  determinant  are  the  variances  and  the  covariance  of  range 
and  deflection  as  estimated  from  the  residuals  from  the  two  regression 
equations  in  which  they  are  the  predictands.  The  numerator  deter- 
minant's elements  are  the  two  quadratic  forms  computed  from  the  two 
sets  of  b"s,  and  the  bilinear  form  with  the  same  matrix  and  involving 
both  sets  of  b"s.  The  value  1.G96  obtained  for  this  generalized  variance 
ratio  would  be  exceeded  by  chance  with  a  probability  considerably  greater 
than  0.05,  though  much  further  below  0.20  than  in  the  separate  variance 
ratios  for  range  and  deflection.  Thus  even  this  more  sensitive  test 
fails  to  give  real  evidence  of  variation  in  mean  point  of  impact  among 
the  men  engaged  in  dropping  the  bombs.  That  there  is  a  real  bias,  at 
least  in  deflection,  is  easily  inferred  from  Table  3.1  or  from  other  bombing 
records.  Apparently  the  bias  is  much  the  same  for  all  the  10  teams  on 
whose  performance  these  records  are  based. 

Application  of  the  corresponding  tests  for  the  effect  on  the  MPI 
of  AES,  of  SBAE,  and  of  direction  of  flight  was  much  simpler  because  each 
of  these  effects  corresponds  to  only  1  degree  of  freedom,  so  that  the  test 
for  either  range  or  deflection  reduces  merely  to  the  comparison  of  a  regres- 
sion coefficient  with  its  standard  error.     When  AES  was  used,  the  MPI 

*  Wilks,  "Generalizations  in  the  Analysis  of  Variance." 
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was  1.6536  mils  farther  in  the  "short"  direction  than  as  if  it  had  not 
been  used,  according  to  the  regression  coefficient,  which  makes  allow- 
ances for  effects  of  SBAE,  direction  of  flight,  and  personnel.  This 
coefficient  is  statistically  significant  on  the  customary  0.05  probability 
level.  Since  the  MPI  for  all  the  bombs  was  1.3450  mils  "over,"  the  use 
of  AES  appears  to  have  been  advantageous  in  this  respect.  The  cor- 
responding coefficient  for  deflection  was  not  significant,  and  the  combined 
test  by  Wilks's  ratio  of  determinants  also  failed  to  yield  a  significant  result 
on  the  0.05  level.  The  same  tests  applied  to  SBAE  gave  results  far  from 
significant,  suggesting  no  difference  according  as  this  equipment  was 
used  or  not. 

The  effect  of  direction  of  flight  on  MPI  was  tested  by  means  of  the 
coefficient  63  in  the  79-term  regression  equation.  For  range  the  effect 
was  insignificant.  But  for  deflection  the  value  —4.1045  was  significant 
even  on  the  0.01  level.  This  indicates  that  the  bias  toward  the  right  was 
considerably  less  flying  north  than  flying  south. 

The  variations  of  MPI  among  flights  were  tested  by  an  approximate 
method,  simpler  than  that  used  for  personnel  variance,  which  would  for 
65  flights  with  53  degrees  of  freedom  have  involved  the  considerable  labor 
of  inverting  a  matrix  of  53  rows.  Such  a  piece  of  calculation  is  not  so 
completely  out  of  the  question  as  would  once  have  been  thought,  but  it 
did  not  seem  worth  while  for  the  present  purpose.  By  ignoring  the 
unbalance  occasioned  by  disproportionate  frequencies  of  equipment  and 
personnel,  while  utilizing  the  balance  of  north  and  south  flights  and  the 
fact  that  exactly  four  bombs  were  dropped  on  each  flight,  these  considera- 
tions lead  to  the  estimate  426"2/53  of  variance  among  flights.  This 
turns  out  to  be  insignificant  for  deflection  but  significant  for  range. 
The  combined  test  for  range  and  deflection  failed  to  yield  a  significant 
result.  Thus  any  important  effect  on  MPI  of  variations  among  flights 
seems  to  be  confined  to  the  range  components.  Such  variations  refer 
to  effects  of  varying  weather  and  other  factors  not  taken  into  account 
in  the  regression  equation  but  do  not  include  the  variations  of  personnel 
and  equipment  considered. 

Since  it  is  to  be  expected  that  pilots  and  bombardiers  will  change 
from  time  to  time,  as  will  also  the  weather,  the  variances  of  deviations 
from  the  79-term  regression  functions  will  be  less  relevant  than  if  the 
effects  of  the  fluctuations  had  not  been  eliminated.  If  the  personnel 
were  to  remain  constant,  and  of  constant  ability,  and  if  an  adequate 
measure  of  seasonal  variation  were  available  from  records  over  a  period  of 
years  based  on  ore-filled  bombs,  the  appropriate  estimates  of  variances 
and  correlation  for  appraising  bombsights  under  test  would  be  in  terms  of 
deviations  from  a  regression  equation  involving  all  these  factors.  The 
omission  of  the  terms  in  the  regression  function  corresponding  to  per- 
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sonnel  may  also  be  justified  on  the  ground  that,  as  seen  above,  no  dis- 
cernible differences  between  the  MPI's  of  different  teams  have  been 
found. 

Inclusion  of  an  allowance  for  equipment  type  does  seem  to  be  needed, 
first  because  of  the  highly  significant  difference  in  MPI  associated  with 
AES,  and  second  because  of  the  prospect  that  future  standardization 
will  lead  either  to  the  use  or  the  nonuse  of  each  of  the  types  of  equipment 
with  all  sights  tested  alike.  Furthermore,  it  is  to  be  hoped  that  methods 
will  be  found  to  eliminate  the  difference  between  the  MPI  of  bombs 
dropped  on  northward  and  on  southward  flights.  The  values  recom- 
mended for  the  coefficients  a,  b,  and  c  of  the  test  functions  T2  to  be  used 
for  future  control  charts  have  therefore  been  based  on  the  residuals 
from  regression  functions  with  only  four  terms.  Besides  a  constant, 
these  terms  involved 

Xi,  which  equals  1  if  AES  was  used  and  is  otherwise  0 

X2,  which  equals  1  if  SBAE  was  used  and  is  otherwise  0 

xs,  Avhich  equals  1  if  heading  was  north,  but  0  if  heading  was  south 

These  regression  equations  were  found  to  be,  for  range  errors  in  mils  over, 
R  =  2.9125  -  1.4457*1  -  2.8149*2  -  1.3746*3 

and  for  deflection,  measured  in  mils  right, 

D  =  5.4099  -  0.8106*i  -  1.0176*2  -  4.3615*3 

The  deviations  from  either  of  these  regression  equations  are  260 
in  number  but  have  only  256  degrees  of  freedom  because  of  the  four 
constants  in  the  equation,  in  accordance  with  well-known  least-square 
theory.  The  sum  of  squares  of  these  deviations  from  one  of  the  equa- 
tions, divided  by  256,  gives  the  estimated  variance  in  range  or  deflection. 
In  the  same  way,  the  sum  of  products  of  deviations  from  the  two  equa- 
tions, divided  by  256,  gives  the  estimated  covariance.     These  values  are 

Range  variance s2  =  Su  =  46.294995 

Deflection  variance s2  =  s22  =  70.033392 

Covariance srd  =  Si2  =  —4.290599 

From  these  values  a  simple  calculation  gives 

sr  =  6.8040  mils         sd  =  8.3686  mils         r  =  -.07535 

the  estimated  values  mentioned  in  previous  parts  of  this  chapter  of  the 
true  standard  deviations  ar  and  ad  and  correlation  p.  The  number  256 
of  degrees  of  freedom  is  large  enough  to  make  the  estimates  sr  and  sd 
quite  reliable.  The  standard  error  of  each  of  these  standard  deviations 
is  only  about  4.42  percent  of  its  value.     That  of  r  is  exactly  ^q,  making 
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the  correlation  only  1.206  times  its  standard  error.  The  probability  of 
so  great  a  value  of  \r\  in  the  absence  of  any  real  correlation  is  about  0.23, 
so  that  the  hypothesis  of  independence  between  range  and  deflection  is 
not  refuted  by  these  data,  though  there  is  a  faint  suggestion  that  bombs 
that  fall  short  tend  to  fall  to  the  right,  while  those  going  over  the  target 
tend  to  fall  to  the  left. 

The  inverse  of  the  matrix 


is 


and  from  the  above  values  we  find 

a  =  0.021724         b  =  0.001331         c  =  0.014360 
the  values  used  in  the  plan  presented  in  Sec.  2. 

5.3.  Ratings  of  Equipment  and  Personnel  by  Test  Bombing 

A  series  of  test  bombings  can  be  utilized  to  obtain  information  not 
only  regarding  the  sights  used,  but  at  the  same  time  regarding  other 
equipment  and  the  men  who  do  the  flying  and  drop  the  bombs.  For  each 
flight  Tl  supplies  a  measure  of  the  general  success,  and  this  is  easily 
analyzed  into  the  components  T2M  and  T2D  corresponding,  respectively,  to 
deviations  of  the  MPI  from  the  target  and  of  the  bombs  from  the  MPI. 
By  combining  the  values  of  Tl  (or  of  T2M  or  of  T2D)  for  all  the  flights  on 
which  a  particular  kind  of  equipment  was  used,  or  on  which  a  particular 
man  or  team  flew,  and  contrasting  with  the  flights  not  involving  the 
same  men  and  equipment,  it  is  possible  to  obtain  ratings  of  personnel 
and  equipment.  Approximate  probability  distributions  are  available 
for  testing  the  reliability  of  such  ratings,  since  each  sum  of  independent 
values  of  T2  may  with  little  error  be  regarded  as  having  the  x2  distribu- 
tion, and  the  ratio  of  two  such  sums  the  F  distribution.  Since  several 
kinds  of  equipment  in  addition  to  the  personnel  can  thus  be  rated  on 
the  basis  of  the  same  bombing  records,  such  multipurpose  test  bombing  is 
relatively  very  economical  and  productive  of  information. 

Complete  validity  of  such  ratings  is,  however,  essentially  dependent 
on  a  condition  that  must  be  provided  before  the  bombings,  not  afterward. 
This  condition  is  that,  subject  to  whatever  general  restrictions  are  appro- 
priate, the  assignment  of  personnel  and  equipment  to  particular  flights 
must  be  strictly  at  random.  There  is  no  way  of  doing  this  except 
by  the  use  of  the  actual  mechanism  of  games  of  chance,  or  of  their 
equivalent  in  the  form  of  tables  of  random  numbers.     In  the  absence  of 
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such  objective  chance,  the  assignments  must  be  made  by  human  beings. 
Experience  has  clearly  shown  that  human  beings  unaided  by  gambling 
equipment  are  unable  to  assign  different  test  materials  to  different 
experimental  trials  in  such  a  way  as  to  avoid  bias,  either  in  the  averages 
to  which  the  experiment  leads  or  in  the  measures  of  error  that  must  be 
computed  as  functions  of  deviations  from  the  observed  averages.  Conse- 
quently, the  ratings  of  the  pilots  and  bombardiers  in  Table  3.4,  while 
historically  accurate,  are  subject  to  the  possibility  of  being  explained  away 
by  features  of  the  method  by  which  assignments  were  made.  This  same 
difficulty  qualifies  the  statistical  interpretation  of  economic  time  series 
and  other  such  records  which,  though  frequently  the  best  available 
guides,  lack  the  kind  of  validity  possible  only  in  a  well-designed 
experiment. 

To  secure  efficiency  as  well  as  validity  in  an  experiment,  it  is  usually 
necessary  to  impose  restrictions  on  the  randomization.  Thus  in  com- 
paring the  results  of  two  treatments  when  the  chance  variances  are  the 
same  in  both  cases,  the  standard  error  of  the  difference  of  the  means  is 
proportional  to  y/l/N  +  1/N',  where  N  and  N'  are  the  numbers  to 
which  the  two  treatments  are  applied.  If  N  +  N'  has  any  fixed  value, 
the  standard  error  is  therefore  a  minimum  when  N  =  N'.  Randomiza- 
tion in  assignment  of  the  cases  to  the  two  treatments  should  then  be 
subject  to  the  condition  N  =  Nf.  This  could  be  done  by  shuffling  and 
randomly  dividing  a  pack  of  cards  consisting  of  black  and  red  cards 
in  equal  numbers.  It  could  not  be  done  merely  by  flipping  a  coin 
once  for  each  case,  since  the  condition  N  =  Nf  might  not  then  be  satisfied. 
This  question  of  efficient  arrangement  of  experiments  through  restricted 
randomization  has  many  ramifications,  particularly  where  several 
questions,  such  as  the  efficacy  of  several  kinds  of  equipment  serving  dif- 
ferent purposes,  are  to  be  tested  simultaneously  in  the  same  experiment.* 

Values  of  T%,  T2M,  and  T\  were  computed  for  each  of  the  65  flights. 
Graphs  of  these  three  functions  in  the  order  in  which  the  flights  were 
made  are  appended  as  Figs.  3.8,  3.9,  and  3.10,  respectively.  These 
figures  have  the  general  appearance  of  control  charts.  They  differ, 
however,  from  suitable  control  charts  in  certain  respects.  Measures  of 
quality  should  be  plotted  on  a  control  chart  in  the  order  in  which  the 
articles  are  manufactured,  whereas  here  the  order  is  that  in  which  the 
tests  were  performed.  In  this  material  several  different  series  of  serial 
numbers  were  represented,  referring  to  different  modifications  of  design 
and  perhaps  different  manufacturers.  It  is  best  to  run  a  separate  control 
chart  for  each  modification  and  for  each  factory. 

The  points  in.  these  graphs  are  on  the  average  higher  relative  to  the 
theoretical  probability  0.20,  0.05  and  0.01  levels  shown  than  would  be 

*  Fisher,  Design  of  Experiments,  is  the  best  treatise  on  this  subject. 


178 


TECHNIQUES  OF  STATISTICAL  ANALYSIS 


Chap.  3 


Scale  of  Probability 


2°l  JO  8|O0S 


Sec.  5.3 


MULTIVARIATE  QUALITY  CONTROL 


179 


Scale  of  Probability 


- 

< 

\  : 

-*=: 

— — 

\ 

^  : 

< 

i 

r 

- 

- 

- 

^~ 

+ 

—" 

— — 

— — ■* 

V - 

- 

<^- 

1  1  1  1 

i  i  i 

1 

1 

1  1 

n 

c 

3 

IX 

■) 

o 

%   8 


3 


8"l  JO   3|03S 


180 


TECHNIQUES  OF  STATISTICAL  ANALYSIS 


Chap.  3 


Scale  of  Probability 


q 

q 

CSJ 

<~ 

- 

^ 

- 

J^ 

1     " 

^^    - 

\      - 

"^>      - 

J 

- 

>V          - 

MM 

1     1 

1    1 

1 

1     1 

_l 

1     1     1 

1     1     1     1 

© 

P< 

a 

od 

E  fe 

«Q 

J3 

E 

b. 

3 

2 

, 

3 

OS 

OU. 

a 

rt 

o 

f-H 

Hi 

a 

m 

01 

P 
1 

1 

d 

w 

CO 

o 

<M 

o 

°l   JO   9|  DOS 


Sec.  5.3 


MULTIVARIATE  QUALITY  CONTROL 


181 


expected  in  a  manufacturing  and  testing  routine  in  a  state  of  control. 
The  reason  for  this  lies  in  a  particular  feature  of  the  way  the  charts  were 
constructed  rather  than  in  any  shortcomings  of  the  sights  tested.  The 
variances  on  which  the  coefficients  in  T2  are  based  were  chosen  in  Sec. 
5.2  on  the  basis  of  deviations  from  4-term  regression  functions,  in  order  to 
eliminate  from  these  estimates  of  variance  the  effects,  which  it  is  to  be 
anticipated  are  not  to  continue  long,  of  the  differences  between  use  and 
nonuse  of  the  Automatic  Erection  System  and  Stabilized  Bombing 
Approach  Equipment,  and  between  northward  and  southward  flights. 
This  elimination  has  the  effect  of  decreasing  the  variances  somewhat  and 
thus  of  increasing  the  coefficients  in  each  of  the  expressions  for  T2. 
If  a,  b,  and  c  had  been  calculated  by  inverting  the  two-rowed  matrix  of 
sums  of  products  of  the  raw  deviations  from  the  target,  and  if  these  raw 
deviations  had  then  been  substituted  in  T%,  T2m,  or  T2Dj  the  average  of 
the  resulting  values  would  for  each  of  these  three  functions  be  exactly 
the  number  of  degrees  of  freedom,  respectively,  8,  2,  and  6.  If,  on  the 
other  hand,  with  the  values  of  a,  b,  and  c  actually  used,  r  and  d  had  been 
replaced  by  the  deviations  from  the  4-term  regression  functions,  the 
averages  would  again  in  each  case  be  exactly  equal  to  the  number  of 
degrees  of  freedom.  But  because  actual  deviations  were  substituted  for 
r  and  d  while  corrected  variances  were  used  in  determining  the  coeffi- 
cients, the  averages  exceed  the  numbers  of  degrees  of  freedom.  Thus  the 
average  of  the  65  values  of  T20  is  9.00786  instead  of  8.  This  is  reflected 
graphically  in  the  fact  that  the  proportions  of  the  65  cases  above  the 
horizontal  lines  are  in  general  somewhat  greater  than  the  respective 
probabilities. 

From  Table  3.3  it  appears  that  AES  and  SBAE,  instead  of  improving 
bombing  accuracy,  seemed  to  make  it  worse.  The  significance  of  this 
apparent  effect  may  be  tested  (with  an  approximation  equivalent  to 
identifying  the  sample  variances  and  covariance  based  on  256  degrees 

TABLE  3.3 

Effect  of  Equipment  on  Over-all  Accuracy  of  Bombing 

(Excellent  bombing  is  indicated  by  low  values  of  Tq) 


Flights  using  AES 

Flights  not  using  AES . . 
Flights  using  SBAE .... 
Flights  not  using  SBAE 


Number 
of  flights 


24 
41 

8 
57 


*n 


228.350001 

357.161062 

79.554375 

505.956688 


Average  T20 


9.51458 
8.71125 
9.94430 

8.87643 


of  freedom  with  the  true  values  of  these  parameters  in  an  infinite  popula- 
tion) by  means  of  the  variance  ratio  distribution  with  numbers  of  degrees 
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of  freedom  equal  to  eight  times  (since  each  flight  has  8  degrees  of  freedom) 
the  numbers  of  flights  with  and  without  the  equipment  in  question. 
For  AES  the  ratio  is 

„       9.51458 


8.71125 


1.09222 


with  rii  =  192  and  n2  =  328.  This  is  not  at  ail  significant  since  the 
probability  of  a  greater  value  of  F  is  considerably  more  than  0.20. 
Actually,  the  relevant  probability  is  more  than  double  this  value  because 
of  our  selection  of  the  greater  mean  value  of  T%  to  put  in  the  numerator. 
For  SBAE  we  have 

„       9.94430 


8.87643 


1.12030 


which,  with  ni  =  64  and  n2  =  456,  is  similarly  insignificant.  Thus  these 
results  do  not  prove  that  AES  and  SBAE  have  deleterious  effects.  They 
would  not  actually  prove  ifc  in  a  really  satisfactory  manner  even  if  the 
results  had  turned  out  to  be  significant,  since  the  assignment  of  the 
equipment  to  particular  flights  was  presumably  not  on  the  basis  of  objec- 
tive randomization.  It  is  entirely  possible  that  the  decision  to  use  or 
not  to  use  either  of  these  kinds  of  equipment  on  each  occasion  had  some 
relation,  conscious  or  unconscious,  to  the  weather,  the  altitude,  the 
personnel,  or  some  other  variable  affecting  the  accuracy. 

The  same  qualifications  apply  to  the  average  values  of  T%  for  the 
10  teams  shown  in  Table  3.4.  Such  scores  are  reliable  indicators  of 
differences  in  personal  ability  only  if  one  of  two  conditions  is  satisfied 

TABLE  3.4 

Variation  of  Over-all  Accuracy  with  Personnel 

(Excellent  bombing  is  indicated  by  low  values  of  Tq) 


Teams 

Number  of  nights 

*n 

Average  TQ 

1-WT-H 

17 

184 . 834959 

10.87264 

2— W-S 

10 

73.105782 

7.31058 

3— R-R 

7 

62.168030 

8.88115 

4— W-H 

3 

30.389824 

10.12994 

5— R-H 

11 

72 . 692086 

6.60837 

6— K-J 

1 

3.240875 

3.24088 

7— K-H 

8 

98 . 558949 

12.31987 

8— K-L 

4 

22.423547 

5 . 60589 

9— W-R 

3 

28 . 637496 

9 . 54583 

10— K-R 

1 

9.459515 

9.45952 

All  nights 

65 

585.511063 

9.00786 

Either  the  assignment  of  the  men  must  be  suitably  randomized,  or  an 
exhaustive  knowledge  of  the  other  causes  of  variation  must  be  at  hand 
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and  must  show  that  these  other  causes  exactly  balanced  as  between 
the  different  teams.  Since  it  is  practically  impossible  to  fulfill  the  second 
condition,  objective  randomization  in  advance  is  the  only  practical  way 
of  validating  scores. 

Even  under  randomization,  differences  in  scores  prove  nothing  unless 
they  are  too  great  to  be  ascribed  to  chance.  As  between  any  two  teams 
selected  without  reference  to  their  scores  in  this  bombing  series,  the 
effects  of  chance  may  be  examined  by  means  of  the  variance  ratio,  equal 
to  the  ratio  of  the  mean  values  of  T%,  with  numbers  of  degrees  of  freedom 
equal  to  the  numbers  of  flights  of  the  two  teams  multiplied  by  the  number 
of  degrees  of  freedom,  here  8,  for  each  flight.  However,  it  would  be  an 
error  to  compare  in  this  way  two  teams  selected  because  their  scores  are 
the  greatest  and  the  least  or  because  they  occupy  adjacent  places  when 
arranged  in  the  order  of  the  scores.  Whether  the  10  scores  as  a 
whole  differ  significantly  from  one  another  might  be  determined  by  the 
application  of  such  a  criterion  as  Bartlett's.*  The  tests  might  also  be 
refined,  if  it  should  seem  worth  while,  by  a  more  complete  segregation 
of  the  effects  of  personnel  from  those  of  each  kind  of  equipment,  here 
represented  by  overlapping  degrees  of  freedom,  and  of  interactions. 
We  are  dealing  here  with  generalizations  of  the  ordinary  analysis  of  vari- 
ance, generalizations  whose  fundamental  theory  has  been  developed 
only  part  of  the  way  necessary  for  an  efficient  analysis  of  the  ramifications 
of  bombing  accuracy. 
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CHAPTER  4 
EFFECTS  OF  ROUNDING  OR  GROUPING  DATA 

1.  INTRODUCTION 

A  statistician  must  of  necessity  work  with  rounded  numbers,  at 
least  to  the  extent  that  his  data  are  recorded  values  of  continuous 
variables,  that  is,  measurements.  Enumeration  data  may  be  rounded 
also,  to  tens,  hundreds,  thousands,  or  millions,  for  convenience  in  han- 
dling. The  more  coarsely  the  data  are  rounded,  the  less  accurately  do 
they  represent  the  facts  observed,  and  very  coarse  rounding  may  render 
them  useless  as  a  basis  for  conclusions  about  the  phenomena  concerned. 
This  is  self-evident;  but  it  is  not  always  appreciated  that  rounding 
considerably  less  coarse  than  that  which  renders  the  data  useless  may, 
nevertheless,  greatly  impair  or  even  invalidate  the  analysis  of  the  data. 

William  Sealy  Gosset,  better  known  as  " Student,"  has  given  a  most 
penetrating  discussion  of  the  effects  of  coarse  rounding  or  grouping  of 
data  on  their  statistical  properties.*  Referring  to  certain  data,  relating 
to  height  and  left  middle  finger  measurements  of  3,000  criminals,  which 
he  had  used  for  an  experimental  test  of  his  theoretical  results,  he  wrote: 

Another  thing  which  interferes  with  the  comparison  is  the  comparatively 
large  groups  in  which  the  observations  occur.  The  heights  are  arranged  in  1-inch 
groups,  the  standard  deviation  being  only  2.54  inches,  while  the  finger  lengths 
were  originally  grouped  in  millimeters,  but  unfortunately  I  did  not  at  the  time 
see  the  importance  of  having  a  smaller  unit  and  condensed  them  into  2-milli- 
meter groups,  in  terms  of  which  the  standard  deviation  is  2.74. 

Several  curious  results  follow  from  taking  samples  of  4  from  material  disposed 
in  such  wide  groups.     The  following  points  may  be  noticed: 

(1)  The  means  only  occur  as  multiples  of  0.25. 

(2)  The  standard  deviations  f  occur  as  the  square  roots  of  the  following  types 
of  numbers:  n,  n  +  0.19,  n  +  0.25,  n  +  0.50,  n  +  0.69,  2n  +  0.75. 

(3)  A  standard  deviation  belonging  to  one  of  these  groups  can  only  be  asso- 
ciated with  a  mean  of  a  particular  kind;  thus  a  standard  deviation  of  y/2  can 
only  occur  if  the  mean  differs  by  a  whole  number  from  the  group  we  take  as 
origin,  while  \/l.69  will  only  occur  when  the  mean  is  at  n  ±  0.25. 

(4)  All  the  four  individuals  of  the  sample  will  occasionally  come  from  the 
same  group,  giving  a  zero  value  for  the  standard  deviation.  Now  this  leads  to 
an  infinite  value  of  z %  and  is  clearly  due  to  too  wide  a  grouping,  for  although  two 

*  Student,  "The  Probable  Error  of  a  Mean,"  Sec.  IV.  (For  full  citations,  see 
Sec.  7.) 

f  Here  Student  uses  the  term  "standard  deviation"  to  refer  to  the  root  mean 
square  of  the  deviations  of  the  four  observations  in  a  sample  from  their  mean. 

%  Here  Student  uses  z  to  refer  to  the  deviation  of  the  sample  mean  from  the  popula- 
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men  may  have  the  same  height  when  measured  by  inches,  yet  the  finer  the 
measurements  the  more  seldom  will  they  be  identical,  till  finally  the  chance  that 
four  men  will  have  exactly  the  same  height  is  infinitely  small.  If  we  had  smaller 
grouping  the  zero  values  of  the  standard  deviation  might  be  expected  to  increase, 
and  a  similar  consideration  will  show  that  the  smaller  values  of  the  standard 
deviation  would  also  be  likely  to  increase,  such  as  0.436,  when  3  fall  in  one  group 
and  1  in  an  adjacent  group,  or  0.50  when  2  fall  in  two  adjacent  groups.  On  the 
other  hand,  when  the  individuals  of  the  sample  lie  far  apart,  the  argument  of 
Sheppard's  correction  will  apply,  the  real  value  of  the  standard  deviation  being 
more  likely  to  be  smaller  than  that  found  owing  to  the  frequency  in  any  group 
being  greater  on  the  side  nearer  the  mode. 

These  two  effects  of  grouping  will  tend  to  neutralize  each  other  in  their  effect 
on  the  mean  value  of  the  standard  deviation  but  both  will  increase  the  variability. 

Accordingly,  we  find  that  the  mean  value  of  the  standard  deviation  is  quite 
close  to  that  calculated,  while  in  each  case  the  variability  is  sensibly  greater. 
The  fit  of  the  curve  is  not  good,  both  for  this  reason  and  because  the  frequency  is 
not  evenly  distributed  owing  to  effects  (2)  and  (3)  of  grouping.  On  the  other 
hand,  the  fit  of  the  curve  giving  the  frequency  of  z  is  very  good,  and  as  that  is  the 
only  practical  point  the  comparison  may  be  considered  satisfactory. 

The  Statistical  Research  Group  received  various  sets  of  data  that 
exhibited  curious  statistical  properties.  In  some  instances  it  was  found 
that  these  properties  were  attributable  to  the  coarse  rounding  employed 
in  recording  the  data.  In  at  least  one  instance  the  peculiarities  were 
attributable  to  the  fact  that  a  fraction  of  the  data  had  been  recorded  with 
half  the  precision  of  the  remainder,  owTing  (in  part)  to  intrinsic  differences 
in  the  precisions  of  the  two  machines  used  to  obtain  the  measurements. 
Occasionally  the  Group  was  asked  to  make  recommendations  with 
regard  to  the  recording  of  data,  since  its  investigations  had  shown  that 
in  many  cases  the  trouble  arose  in  connection  with  recording  the  data 
and  not  in  the  actual  reading  of  the  scales.  As  a  consequence  of  difficul- 
ties encountered  in  analysing  coarsely  rounded  data  and  of  requests  for 
advice  on  rounding  in  general,  a  study  of  rounding  was  undertaken, 
principally  from  the  viewpoint  of  its  effect  on  the  standard  procedures 
of  statistical  analysis.  The  results  of  this  study  are  summarized  in  this 
chapter.  Much  of  the  material  presented  here  can  be  found  in  the  litera- 
ture of  statistics,  but  it  is  widely  scattered. 

2.  ROUNDED  NUMBERS  AND  SCALE  READINGS 

Rounded  numbers  enter  the  practical  man's  world  through  two  differ- 
ent doorways.  First,  the  measurements  of  a  scientist  or  engineer  are 
rounded  numbers,  since  all  measuring  equipment  is  of  limited  accuracy. 
Second,  data  are  often  recorded  to  less  accuracy  than  is  attainable  with 
the    available   equipment,    the    accuracy   preserved   being   more   easily 

tion  mean,  divided  by  the  standard  deviation  of  the  sample,  the  latter  defined  as  above 
(see  Sec.  4.4  below). 
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attained  and  " adequate  for  the  purposes  at  hand";  or  they  are  rounded 
in  computing,  sometimes  to  simplify  arithmetical  calculations,  as  when 
0.40  is  used  in  place  of  51/128  (0.3984375  is  exact),  and  sometimes 
because  it  cannot  be  avoided,  as  when  3.1416  is  used  in  place  of  7r.* 

In  the  case  of  the  computer's  reduction  of  exact  numbers  to  rounded 
numbers  in  decimal  form,  assuming  that  he  has  rounded  them  correctly 
in  accordance  with  the  rule  given  in  the  next  section,  it  can  be  stated 
that  each  is  accurate  within  }/%  unit  in  the  last  decimal  place  retained.  In 
the  case  of  the  scientist's  or  the  engineer's  measurements  this  may  not  be 
true,  and  yet  the  last  figures  retained  may  contain  worth-while  informa- 
tion, in  the  sense  that  when  the  last  figures  are  retained  the  data  are 
more  accurate  than  when  the  last  figures  are  omitted  entirely  or  are  used 
as  a  basis  for  rounding  the  results  to  one  less  place.  In  such  cases,  if 
the  measurements  are  all  of  comparable  accuracy,  there  will  exist  a 
"unit"  of  measurement,  with  respect  to  which  the  uncertainty  of  a 
given  measurement  is  }/%  a  unit. 

Consider,  for  example,  the  case  of  an  experimenter  taking  " readings" 
on  a  scale  with  graduations  every  }£  in.  He  can  read  his  determination 
to  the  nearest  }/%  in.  directly  and  may  then  visually  estimate  it  to  the 
nearest  }{q  in.,  obtaining,  say,  a  reading  of  3J^6  in.  If  he  writes  this  out 
in  decimal  form  it  becomes  3.4375  in.,  and  represents  some  number 
between  3.40625  in.  (3%  in.)  and  3.46875  in.  (3%  in.),  and  this  is 
accurate  only  to  within  +0.03125  in.  and  not  to  within  0.00005  in.,  as  the 
number  of  decimal  places  retained  might  suggest.  He  may  decide, 
therefore,  to  report  the  value  as  3.44,  which  is  accurate  only  to  within 
+0.02875  and  —0.03375,  but  which  will  be  closer  to  the  true  value  than 
is'  3.4,  unless  the  true  value  lies  between  3.40625  and  3.42000.  When  so 
reported,  however,  his  readings  will  not  give  a  clear  indication  of  their 
accuracy.  If  an  unambiguous  indication  of  their  accuracy  is  important, 
he  should  report  his  readings  in  units  of  }4>  in.,  in  which  event  they  will  be 
accurate  to  within  plus  or  minus  J^  unit. 

3.  THE  ROUNDING  RULE  AND  ITS  PROPERTIES 

When  a  number  expressed  in  the  decimal  system  is  to  be  rounded 
to  a  specific  number  of  significant  figures, f  the  rounding  should  always  be 
carried  out  in  accordance  with  the  following  rule: 

*  The  mathematician  has  devised  convenient  symbols  for  certain  very  practical 
quantities,  such  as  the  diagonal  of  a  square  of  unit  side  (\/2)  and  the  ratio  of  the 
circumference  of  a  circle  to  its  diameter  (w),  which  cannot  be  expressed  exactly  as 
common  fractions  or  as  decimal  numbers  with  a  finite  number  of  decimals.  By 
using  the  mathematician's  symbols  for  these  quantities,  the  practical  man  can  usually 
keep  his  algebraic  manipulations  simple,  but  sooner  or  later  there  comes  a  point  at 
which  a  numerical  answer  is  needed,  and  here  it  becomes  necessary  to  replace  it 
(3.14159  26535  89793  23846+)  with  a  common  fraction,  such  as  355/113  (which  is 
correct  to  6  decimal  places),  or  with  a  decimal  approximation  such  as  3.141593. 

f  A  significant  figure  of  a  number  (expressed  in  the  decimal  system  of  notation) 


190  TECHNIQUES  OF  STATISTICAL  ANALYSIS  Chap.  4 

Rounding  Rule: 

a.  If  the  first  digit  to  be  dropped  is  less  than  5,  the  last  digit  retained 
shall  be  left  unchanged. 

b.  If  the  first  digit  to  be  dropped  is  greater  than  5,  or  is  a  5  followed 
by  digits  greater  than  0,  the  last  digit  retained  shall  be  increased  by  1. 

c.  If  the  first  digit  to  be  dropped  is  a  5  alone,  or  a  5  followed  by  0's 
only,  the  last  digit  retained  shall  be  rounded  to  the  nearest  even  number. 
Thus,  rounding  to  2  decimal  places, 

7T  =  3.141593  yields  3.14 

e  =  2.718282  yields  2.72 

1/22  =  0.045455  yields  0.05 

1/200  =  0.005000  yields  0.00 

1/198  =  0.005051  yields  0.01 

10/2,247  =  0.004450  yields  0.00 

It  is  clear  that  to  every  number  that  is  to  be  " rounded  down"  in 
accordance  with  part  (a)  of  the  rule  there  corresponds  a  number  that  is 
to  be  " rounded  up"  in  accordance  with  part  (6).  Parts  (a)  and  (6)  thus 
establish  a  balance  between  rounding  down  and  rounding  up  for  all 
numbers  other  than  those  that  are  exactly  5  in  the  first  place  to  be 
dropped.  Part  (c)  establishes  a  similar  balance  between  rounding  down 
and  rounding  up  for  numbers  that  are  exactly  5  in  the  first  place  to  be 
dropped,  by  specifying  that  the  rounding  shall  be  up  when  the  preceding 
digit  is  1,  3,  5,  7,  and  9 ;  down,  when  it  is  0,  2,  4,  6,  8.  Part  (c)  is  arbitrary, 
since  rounding  to  the  nearest  odd  number  would  establish  a  balance  in 
these  cases  equally  well,  being  equivalent  to  merely  interchanging  the 
words  up  and  down  in  the  preceding  sentence.  However,  uniformity  of 
procedure  is  important  for  checking,  and  consistency  of  published  tables  is 
desirable,  so  part  (c),  being  well  established,  should  be  adhered  to. 

A  question  that  arises  naturally  at  this  point  is:  Will  repeated  applica- 
tions of  the  rule  (dropping  digits  singly,  or  in  pairs,  or  otherwise)  always 
lead  to  the  same  result,  when  the  rounding  is  stopped  at  a  specified  place, 
as  a  single  rounding  to  that  place?  The  answer,  unfortunately,  is  No. 
The  fifth  example  in  the  list  above  is  a  case  in  point.  If  0.005051  is 
rounded  to  2  decimal  places  in  accordance  with  the  rule  in  almost  any 
manner  involving  more  than  one  step,  the  result  is  0.00,  which  is  " incor- 
rect/' The  third  example  may  be  used  to  illustrate  the  fact  that  the 
number  of  places  to  which  a  number  is  originally  tabulated  may  affect 
the  outcome  of  subsequent  rounding:  }^2  =  0-0  45  45  45  45  ad  infinitum; 

is  any  digit  that  contributes  to  the  specification  of  the  magnitude  of  the  number  aside 
from  serving  to  locate  the  decimal  point.  For  example,  345  and  0.000345  each  have 
three  significant  figures;  34.05  and  34.50  (as  distinct  from  34.5)  each  have  four 
significant  figures;  34500  has  3,  4,  or  5  according  as  it  means  3.45  X  105,  3.450  X  106, 
or  3.4500  X  105,  respectively. 
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correct  to  6  decimal  places  it  equals  0.045  455,  the  value  given  in  the 
table  above;  if  this  latter  value  is  rounded  to  5  decimals  in  accordance 
with  the  rule,  the  result  is  0.045  46;  however  ^2  equals  0.045  45  correct  to 
5  decimals,  the  result  that  will  be  obtained  by  rounding  to  5  decimals 
when  the  value  originally  tabulated  is  correct  to  7  decimals  or  more. 

It  is  clear  from  the  foregoing  that,  when  an  n-place  table  is  prepared  by 
rounding  the  entries  given  in  an  (n  +  /c)-place  table,  some  of  the  entries 
may  not  be  correct  in  the  nth  place.  Thus,  in  a  table  of  loge  N  to  5 
decimal  places,  the  5-place  value  given  for  loge  1947  =  7.57404  50054 
will  be  incorrect  in  the  fifth  place  if  obtained  by  rounding  from  less  than 
an  8-place  table  with  entries  correct  in  the  eighth  place.  Final  5's  can, 
however,  be  marked  to  show  whether  further  rounding  should  be  up 
or  down;  see,  for  example,  Tables  1.1  and  1.5  of  Chap.  1,  in  which  each 
final  5  is  italicized  if  rounding  should  not  be  in  accordance  with  part  (c) 
of  the  rule. 

It  may  be  asked  whether  there  exists  a  rounding  rule  that  will  always 
lead  to  the  same  result  at  a  specified  place,  irrespective  of  the  number  of 
applications  of  the  rule  employed  in  rounding  to  that  place.  A  procedure 
suggested  by  Cayley*  possesses  this  advantage.  According  to  this 
procedure,  the  figure  in  the  last  place  to  be  retained  is  not  adjusted  at 
all — which  is  equivalent  to  rounding  down  irrespective  of  the  digits  that 
follow — with  the  result  that  the  number  recorded  either  is  exact  or  errs 
in  defect.  If,  when  a  recorded  number  of  this  type  is  used,  a  final  5  is 
affixed  to  it,  then  the  error  will  lie  between  plus  and  minus  ^  unit  in  the 
last  place  recorded,  as  is  the  case  when  the  number  recorded  is  rounded 
in  a  single  step  from  its  exact  value  in  accordance  with  the  Rounding 
Rule.  For  example,  w  would  be  recorded  to  4  decimals  as  3.1415,  and 
would  be  taken  to  be  3.14155  in  further  calculations.  Whereas  Cayley's 
procedure  has  the  advantages  that  it  is  simple  (since  numbers  are  always 
rounded  down),  that  it  can  be  mechanized  for  purposes  of  machine 
computation,  and  that  it  yields  an  n-place  recorded  number  that  is 
independent  of  the  number  of  rounding  steps  involved  in  reaching  it  from 
the  exact  value,  it  has  the  disadvantage  that  it  requires  the  use  of  (n  +  1)- 
place  numbers  in  further  computations  in  order  to  attain  the  same 
degree  of  accuracy  that  would  be  obtained  by  using  n-place  numbers 
rounded  in  accordance  with  the  Rounding  Rule. 

4.  EFFECTS  OF  ROUNDING  ON  STATISTICAL  ANALYSES 
OF  LARGE  NUMBERS  OF  OBSERVATIONS 

4.1.  Nature  of  the  Effects 

Frequently  a  statistician  analyzes  measurements  recorded  to  several 
decimal  places  more  than  can  possibly  have  any  practical  significance. 
*  Cayley,  letter  to  J.  W.  L.  Glaisher. 
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The  extra  decimal  places  do  not  occasion  any  great  concern,  for  nothing 
has  been  done  that  cannot  be  undone.  Recording  the  extra  digits  and 
rounding  the  measurements  are  merely  unnecessary  nuisances.  On  the 
other  hand,  measurements  are  sometimes  recorded  to  the  nearest  5,  10, 
20,  50,  or  100  units,  or  still  more  coarsely,  when  they  could,  and  should, 
have  been  recorded  to  the  nearest  unit,  or  even  more  accurately.  Thus,  if 
duplicate  measurements  taken  under  essentially  the  same  conditions  tend 
to  differ  on  the  average  by  about  50  units,  research  workers  often  record 
the  measurements  to  the  nearest  10  or  20  units;  sometimes  only  to  the 
nearest  50  units.  If  measurements  taken  under  essentially  the  same 
conditions  tend  to  be  distributed  approximately  normally  about  a 
"true  value"  with  a  standard  deviation  of  a,  these  roundings  correspond 
to  intervals  with  widths  roughly  0.23a,  0.45a-,  and  1.1a,  respectively.* 
Statisticians,  on  the  other  hand,  recommend  that  the  width  of  the  round- 
ing interval  be  less  than  one-third,  or,  better,  less  than  one-fourth  the 
standard  deviation  of  random  sampling,  f 

Basically,  the  choice  of  the  size  of  a  rounding  or  grouping  interval 
depends  on  the  answers  to  the  following  three  questions  :- 

a.  If  the  observations  Xi,  x2,  .  .  .  ,  xN  are  to  be  regarded  as  inde- 
pendent values  of  a  random  variable  x,  which,  in  the  population  from 
which  the  data  are  drawn,  is  distributed  about  a  mean  of  /jl  with  a  stand- 
ard deviation  a,  then  what  is  the  effect  of  the  rounding  (grouping)  on  the 
sample  estimates  of  /jl  and  a,  defined  by 

N 


■  =1 

N 


*  =  ^  (1) 


and 


(2) 

b.  What  is  the  effect  of  the  rounding  on  confidence  limits  for  /jl  and  a 
based  on  x  and  s? 

c.  What  is  the  effect  of  the  rounding  on  statistical  tests  of  hypotheses 
about  ju  and  a  based  on  x  and  s? 

In  the  present  section,  answers  to  these  questions  will  be  given  for 
the  case  when  the  number  of  observations  N  is  large.  In  Sec.  5  the  case 
where  N  is  small,  perhaps  only  2  or  3,  is  discussed;  the  results  given  there 

*  The  average  difference  between  duplicate  measurements  is,  under  the  assump- 
tions stated,  2<r/-\/^r  =  1.12838a.  If  this  average  difference  is  50,  a  grouping  interval 
of  10  corresponds  with  1.12838o-/5  =  0.226a,  and  similarly  for  a  grouping  interval 
of  20. 

f  See,  for  example,  Fisher,  Statistical  Methods,  Sec.  13. 
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throw  light  on  what  values  of  N  may  be  regarded  as  large  and  what  values 
must  be  regarded  as  small. 

4.2.  Sheppard's  Corrections 

In  order  to  answer  the  first  question,  consider  the  relation  between 
the  kth  (population)  moment 

li'k  =  E(xk)  =  |+j°  xkp(x)  dx         (k  =  1,  2,  3,  4,  •  •  •  )  (3) 

of  a  continuous  random  variable  distributed  according  to  the  probability 
density  function*  p(x),  and  the  kth  moment 

v'k  =     I     ^  CJ  m  dt         {k  ='  1;  2,  3,  4,  •  •  •  )  (4) 

i=  -  oo  l      2 

of  the  histogram  formed  by  grouping  the  values  of  x  into  classes  of  width 
w  with  mid-points  Xi,  (i  =  0,  ±1,  ±2,  •  •  •  ).  The  ^  are,  by  definition, 
the  moments  (about  the  origin)  of  x  in  the  population,  in  which  x  is  a 
continuous  variable.  On  the  other  hand,  since  it  is  not  possible  either 
to  read  or  to  record  with  perfect  accuracy  all  values  of  a  continuous 
variable,  recorded  values  of  x  will  generally  be  rounded  values,  any  value 
of  x  between  Xi  —  (w/2)  and  x%  +  (w/2)  being  recorded  as  xi.  Hence,  in 
a  sense,  the  v'k  are  realistic  moments  of  x  and  the  /4  are  ideal  moments. 

If,  first,  p(x)  can  be  expanded  in  a  Taylor  series  throughout  the  range 
for  which  p(x)  >  0,  second,  xkp(2n)(x)  is  finite  and  continuous  throughout 
this  range  (k  =  1,  2,  3,  4,  •  •  •  ;  n  =  1,  2,  3,  4,  •  •  •  ),  and  third,  p{x) 
and  p(n)(x)  (n  =  1,  2,  •  •  •  )  vanish  at  the  limits  of  this  range,  then 
the  yk  and  the  /*£  are  related  as  follows: 

v[  =  Mi 


*'       '  J-  w 
^2  =  M2  -r  Y2 

VZ    =    M3   +   \^Ty 

1  Mi 

^4    =    M4  +   [j/ 

1     '  J-  ™4 

(5) 


2  WD" 


M/c— 2i 


",  =   >    |CM£)    (2»-+.l) 

0 

where  CJ  =  k\/[r\(k  -  r)\]  for  0  <  r  <  k,  and  Ckr  =  0  for  r  >  fc.f 

*  That  is,  for  any  value  of  c,  the  integral  P(c)  =   I        p(x)  dx  equals  the  probability 

that  x  is  less  than  c,  0  <  P(c)  <  1  for  all  values  of  c,  P(-  oo  )  =0,  andP(+oo)  =  1. 

f  Sheppard,  "Most  Probable  Values  of  Frequency  Constants."  The  correspond- 
ing case  of  two  variables  has  been  considered  by  Baten,  "Corrections  for  the 
Moments,"  and  of  several  variables  by  Wold,  "Sheppard's  Correction  Formulae." 
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The  three  conditions  regarding  p(x)  imply  that  it  is  not  only  con- 
tinuous over  the  range  for  which  p(x)  >  0  but  tapers  off  strongly  to  zero 
at  the  ends  of  this  range ;  they  thus  exclude  probability  curves  with 
abrupt  ends,  such  as  a  uniform  distribution  over  —w/2  to  +w/2,  and 
^/-shaped  or  J-shaped  curves.  The  three  conditions  are  necessary  and 
sufficient  for  the  relations  (5)  to  hold  exactly.  Relations  analogous  to 
(5),  with  additional  terms,  have  been  obtained  for  cases  excluded  by  the 
above  assumptions.* 

The  relations  (5)  can  be  derived  in  a  somewhat  more  general  manner 
that  shows  that  they  are  valid  on  the  average  whenever  the  moments 
involved  exist,  provided  that  the  rounding  lattice  is  imposed  at  random 
on  the  underlying  distribution,  f  Suppose  the  x  axis  is  first  divided  in  a 
definite  manner  into  an  infinite  number  of  class  intervals  of  width  w/m, 
where  m  is  some  integer  greater  than  1.  Given  a  probability  density 
function  p(x),  the  histogram  corresponding  to  this  grouping  of  the  values 
of  x  can  always  be  determined  from 


p(Xj) 


■Xj  + 


/      A  \   Vif)  dt 


(j  =  0,  ±1,  +2,  • 


) 


(6) 


where  Xj  is  the  mid-point  of  the  jth  subdivision,  since  the  integrals 
involved  will  always  exist  when  p(x)  is  a  probability  density  function. 
The  moments  about  the  origin  of  this  basic  histogram,  /4',  say,  can  be 
determined  from  the  summation  analogous  to  (4),  so  far  as  they  exist 
(that  is,  are  finite).  Corresponding  to  the  basic  histogram  determined 
by  (6),  m  new  histograms  can  be  formed  by  combining  the  successive 
subclasses  of  width  w/m  into  subclasses  of  width  w,  the  intervals  con- 
taining Xo,  Xi,  ...  ,  xm-i  being  combined  in  one  instance,  the  intervals 
containing  xh  x2,  .  .  .  ,  xm  being  combined  in  the  next  instance,  and 
so  forth.  Corresponding  to  each  of  these  histograms,  moments  about  the 
origin,  fJ(s),  s  =  1,  2,  •  •  •  ,  m,  can  be  determined  from  summations 
analogous  to  (4),  so  far  as  they  exist.  Let  v'k  denote  the  average  of  the 
values  of  v'h(s),  s  =  1,  2,  •  •  •  ,  m,  then  the  v'h  and  the  /*Jf  are  related  as 
follows,  in  so  far  as  they  exist: 
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240 


*  See  papers  by  Elderton,  Pairman  and  Pearson,  Pearse,  and  Pearson  listed  in 
Sec.  7  below. 

f  See  the  papers  by  Abernathy,  Craig,  and  Carver  listed  in  Sec.  7. 
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and  so  forth.  Letting  m  — »  oo ,  the  right-hand  sides  of  these  equations 
tend  to  the  corresponding  expressions  in  equations  (5),  since  /4'  — ■»  ^  as 
m  — >  oo .  On  the  other  hand,  the  v'k  retain  their  meanings  as  average 
values  of  the  vk.  Accordingly,  whatever  the  nature  of  the  distribution 
of  x,  equations  (5)  hold  on  the  average,  that  is,  they  give  the  average 
relationships  between  the  vk  and  the  ^k,  provided  these  are  finite,  for  an 
infinite  series  of  trials  in  which  a  rounding  lattice  composed  of  sections  of 
width  w  is  imposed  at  random  on  the  underlying  probability  distribution. 
The  results  given  above  all  refer  to  moments  about  a  fixed  point 
taken  as  the  origin,  and  the  values  of  the  moments  involved  will  depend 
upon  the  choice  of  the  origin.  Thus,  if  x  denotes  temperature  and  is 
measured  in  degrees  Fahrenheit,  the  values  of  the  n'k  and  vk  will  be  differ- 
ent from  those  obtained  when  x  is  measured  in  degrees  centigrade.  The 
results  obtained  will  be  the  same,  however,  if  the  origin  is  chosen  at 
the  mean  of  the  distribution  of  x.  Dropping  the  primes  to  signify 
moments  about  the  mean  of  x,  relations  (5)  correspond  with 
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and  so  forth. 

Comparison  of  the  second  equation  of  (8)  with  (22)  shows  that  the 
variance  (second  moment  about  the  mean)  of  a  rounded  value  of  x,  which 
may  be  denoted  by  xR  for  convenience,  equals  the  variance  of  x,  namely  /jl2, 
plus  a  variance  w2/12,  which  equals  the  variance  of  a  rounding  error  e, 
which  is  distributed  uniformly  between  —  w/2  and  +  w/2.  This  suggests 
that  we  may  write 

xR  =  X  +  €  (9) 

where  xB  denotes  a  value  of  x  rounded  to  within  ±w/2,  and  €  is  a  random 
variable  independently  distributed  with  uniform  probability  density 
between  —w/2  and  -\-w/2.  That  the  representation  (9)  is  strictly  valid 
under  the  three  conditions  stated  above,  and  that  it  is  valid  on  the  average, 
in  the  above  sense,  under  any  conditions  for  which  the  moments  exist, 
has  been  demonstrated  rigorously.  * 

Since  it  is  usually  desired  to  determine  the  fik  from  the  vk,  equations 
(8)  are  customarily  solved  for  the  pk  in  terms  of  the  vk  and  powers  of  w, 
and  the  relations  given  as 

*  See  papers  by  Craig  and  by  Langdon  and  Ore,  listed  in  Sec.  7. 
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(10) 
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v± 

w2 
2"' 

^2 

+ 

7w4 
240 

and  so  forth.* 
These  adjustments  of  the  vk  to  obtain  the  nk  are  customarily  called 
Sheppard's  corrections  for  grouping.  As  has  been  indicated  above,  they 
are  valid  as  average  adjustments  whenever  the  moments  involved  exist, 
provided  that  the  rounding  lattice  is  imposed  at  random  on  the  under- 
lying distribution.  This  latter  provision  means,  for  example,  that  when 
values  of  a  variable  that  can  take  only  nonnegative  values  are  to  be 
rounded,  zero  shall  not  inevitably  be  adopted  as  the  lower  extremity  of 
the  lowest  class — indeed,  the  mid-point  of  the  lowest  class  may  turn 
out  to  be  a  small  negative  number.  If  the  rounding  lattice  is  not  imposed 
at  random  when  Sheppard's  original  assumptions  are  not  fulfilled, 
then  the  foregoing  adjustments  may  be  wildly  wrong  and  may  even 
adjust  in  the  wrong  direction,  f 

4.3.  Comparison  of  the  Distribution  of  the  Mean  of  N  Rounded  Observa- 
tions and  the  Distribution  of  the  Mean  of  N  Unrounded  Obser- 
vations, When  N  Is  Large 

Sheppard's  corrections  make  possible  a  concise  comparison  of  the 
distribution  of  the  mean  of  N  rounded  observations  and  the  distribution 
of  the  mean  of  N  unrounded  observations,  when  N  is  large.  Continuing 
the  notation  adopted  in  connection  with  (9),  let 


I  w  (11) 


Xr   = 

i  =  l 

where  xRi  is  the  rounded  value  of  xi}  the  ith  of  N  independent  values  of  a 
random  variable  x,  corresponding  to  a  rounding  lattice  with  classes  of 

*  The  corresponding  relations  for  ju5  to  ju8  are  given  in  Rietz,  Handbook,  p.  94. 
They  can  be  obtained  by  letting  m  — >  «  in  the  expressions  for  ^  to  fx8  given  by  Craig. 
Letting  A*  and  Lk  (k  =  ,1,  2,  •  •  •  )  denote  the  semi-invariants  of  the  unrounded 
variable  x  and  the  rounded  variable  xR,  respectively,  the  corresponding  relations 
become  simply 

X2m  =  L2m  +  (-i)^-^ 

X2m+1    =   iv2m+l 

where  Bm  is  the  mth  Bernoulli  number.  Thus,  X4  =  L4  +  w4/120,  x6  =  L6  -  w*/252, 
X8  =  L8  +  w8/240,  and  so  forth.  For  further  details,  see  the  papers  by  Craig, 
Langdon  and  Ore,  and  Wold. 

f  The  reader  who  wishes  to  delve  further  into  the  mathematical  background  of 
Sheppard's  corrections  is  urged  to  read  Kendall's  paper. 
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width  w  (located  at  random  on  the  x  axis,  if  Sheppard's  assumptions  are 
not  fulfilled).     Furthermore,  let 

the  mean  of  x  in  the  population  =  n[.x 

the  variance  of  x  in  the  population  =  v\  —  fx2:x     (12) 

the  Arth  moment  of  x  about  its  population  mean  =  fxk.x 

(k  =  1,  2,   ■  •  ■  ) 
and  let 

w  =  a  •  ax  (13) 

Table  4.1  presents  a  comparison  of  the  distribution  of  xR  in  regard  to 
mean,  variance,  skewness,  and  kurtosis  with  the  distribution  of  x  and 
with  a  normal  distribution  of  mean  ,uj  and  variance  o-2. 
From  a  study  of  Table  4.1  it  is  apparent  that 

(1)  x  and  xR  are  each  unbiased  estimators  of  the  population  mean  of 
x. 

(2)  The  variance  of  xR  is  (1  +  a2/ 12)  times  the  variance  of  x. 

(3)  The  skewness  (71  =  +  Vhi  of  Pearson's  notation)  of  the  dis- 
tribution of  xR  never  exceeds  that  of  the  distribution  of  x. 

(4)  The  kurtosis  (72  =  /32  —  3  of  Pearson's  notation)  of  the  distribu- 
tion of  xR  is  always  less  than  that  of  the  distribution  of  x. 

(5)  When  N  is  large,  x  and  xR  are  approximately  normally  distributed 
about  the  population  mean  of  x,  fx[:x. 

The  case  in  which  the  ungrouped  variable  x  is  normally  distributed 
is  of  special  interest.  In  this  case  x  is  likewise  normally  distributed, 
whereas  the  distribution  of  xR  is  symmetric  but  has  slightly  less  kurtosis 
than  a  normal  distribution,  the  deficiency  being  (a4/12(W)/(l  +  a2/12)2. 
From  the  Gram-Charlier  series  for  the  " generalized  normal  curve,"* 
it  is  seen  that  this  deficiency  in  kurtosis  implies  that  the  distri- 
bution of  xR  has  slightly  less  area  in  the  " central  region,"  say,  between 
±0.75aZR,  slightly  less  also  in  the  "tails,"  say,  beyond  ±2.3^,  and 
slightly  more  between  these  points.  For  any  practical  rounding,  say, 
a  =  w/a-  <  1,  this  effect  will  surely  be  negligible.  R.  A.  Fisher  has 
shown  f  that  the  maximum  likelihood  estimate  A  of  n'1:x  from  N  inde- 
pendent rounded  values  xR  of  a  normally  distributed  variable  x  has  a 
variance  of  (1  +  a2/l2  —  a6/2,880  _)_...).  crl/N.  The  loss  of  precision 
when  xR  is  used  instead  of  /x  is,  therefore,  only  a6/2,880,  which  is  clearly 
negligible  for  any  practical  rounding. 

In  conclusion 

a.  xr  is  an  unbiased  estimator  of  the  population  mean  of  x,  /j.'1:x, 
whatever  the  value  of  N. 

b.  When  N  is  large  and  the  values  of  al  and  w  are  known,  so  that 
<t2Xr  =  (o-2.  +  w2/12)/N    can   be    evaluated,    confidence    limits  for,   and 

*  See,  for  instance,  Rietz,  Handbook,  p.  114. 
t  Fisher,  " Mathematical  Foundations,"  p.  362. 
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statistical  tests  of  hypotheses  regarding  the  value  of  /jl[:x  carried  out 
with  (xR  —  ix[:x)/<nB  regarded  as  a  normal  deviate  will  be  approximately 
correct — provided  the  true  distribution  of  x  has  "high  order  contact" 
(that  is,  Sheppard's  assumptions  are  valid),  or  the  rounding  lattice  is 
located  at  random  with  respect  to  the  admissible  values  of  x,  or  both. 


TABLE  4.1 

Comparison  of  the  Mean,  Variance,  Skewness,  and  Kurtosis  of  the 
Distribution  of  the  Average  of  N  Independent  Rounded  Values  of  a  Random 
Variable  with  the  Corresponding  Characteristics  of  the  Distribution  of  the 
Unrounded  Variable  and  of  a  Normal  (Gaussian)   Distribution  When  the 
Width  of  the  Rounding  Interval  Is  a  Standard  Deviation  Units  (w  =  a  •  a) 


Characteristic 


Mean .  .  . 

Variance 


Skewness  (71). 


Kurtosis  (72) 


Distribution 


Unrounded 
variable,  x 


Mi: 


71: 


O"4 


Average  of  N 
rounded  values,  xR 


I--V.X 


('♦£) 


N 


yv.x 


M'-i)' 


a4 
120 


(•>i)' 


V 


Normal 


Mi 


4.4.  Comparison  of  the  Distribution  of  s2  for  Rounded  and  Unrounded 
Observations,  When  the  Number  of  Observations  Is  Large 

Let 


1 


(Xrj   —   Xr)2 

N  -  1 


(14) 


where  xRi  (i  =  1,  2,  •  •  •  ,  N)  and  xR  are  as  in  (11).  In  Table  4.2  the 
mean  of  si  and  the  ratio  of  the  variance  of  s|r  to  the  square  of  its  mean 
are  compared  with  the  mean  and  corresponding  ratio  for  s2x  under  two 
sets  of  conditions:  first,  when  x  has  any  distribution  for  which  the 
first  four  moments  exist;  second,  when  x  is  normally  distributed. 
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It  is  evident  from  a  study  of  Table  4.2  that  the  ratio 

(variance  of  s2)/(mean  of  s2)2 

is  always  less  in  the  case  of  s%  than  in  the  case  of  $*.  The  difference 
is  slight,  however,  and  for  any  reasonable  rounding,  say,  a  =  w/a  <  1, 
will  generally  be  negligible.     Therefore 

(1)  When  N  is  large,  (N  -  l)s2xJ(a2x  +  w2/12)  and  (N  -  l)s2x/a2x  will 
be  distributed  as  x2  with  N  .—  1  degrees  of  freedom  to  about  the  same 

order  of  approximation. 

TABLE  4.2 
The  Mean  of  s2.   and  of  the  Ratio  of  the  Variance  of  s2x    to  the  Square  of  the 

XR  XR 

Mean  Compared  with  the  Corresponding  Quantities  for  s*,  When  the 
Distribution  of  x  Is  Arbitrary*  and  When  the  Distribution  of  x  Is  Normal 


(-9 


Distribution  of  x 

Arbitrary 

Normal 

Statistic 

*: 

si 

XR 

si 

Mean        

2               72:, 

N  -  1  r   N 

a4 
2            72:x        120 
N  -  1    '           N 

d 

Variance 

2 

(Mean)2 

N  -  1 

*  Subject  to  the  restriction  that  the  first  four  moments  of  x  about  its  population  mean  exist. 

(2)  The  quantity  s|  —  w2/l 2  is  an  unbiased  estimator  of  the  variance 
of  x  in  the  population  u2x1  whatever  the  value  of  N,  provided  the  distribu- 
tion of  x  has  high  order  contact  (that  is,  Sheppard's  three  assumptions, 
stated  in  Sec.  4.2,  are  fulfilled),  or  the  rounding  lattice  is  located  at 
random  with  respect  to  the  admissible  values  of  x,  or  both.  * 

It  may  be  well  to  point  out  here  that  the  ratio 

(At  -  l)(sls  -  w*/lZ)/4 

should  not  be  used  as  a  substitute  for  the  ratio  given  in  (1)  unless  a  =  w/cr 
is  quite  small,  since  the  variance  of  this  ratio  exceeds  the  variance  of  the 
ratio  given  in  (1)  by  a  factor  of  (1  +  a2/12)2,  which  will  generally  render 

*  When  N  is  small,  the  mean  and  variance /(mean)2  relations  shown  in  Table  4.2 
are  still  valid,  but  the  discontinuities  of  the  distribution  of  si  are  such  that  graduation 
of  the  distribution  of  si  by  a  continuous  distribution  is  virtually  impossible  (see 
Sec.  5.1). 

Kendall  has  remarked  that  (2)  gives  the  correct  estimator  of  o-2x  if  the  rounding  of 
the  values  of  x  is  determined  before  the  sample  is  drawn.  If,  on  the  other  hand,  the 
observations  xi,  X2,  .  .  .  ,  xn  are  drawn  first  and  then  rounded,  he  suggests  that 
slR  —  w2/[12(N  —  1)]  be  used  to  estimate  a\.  When  N  is  large  the  difference  is 
clearly  negligible  from  a  practical  standpoint  but  is  of  some  theoretical  interest. 


200  TECHNIQUES  OF  STATISTICAL  ANALYSIS  Chap.  4 

the  x2  approximation  less  accurate.  If  the  adjusted  numerator  and  the 
denominator  a\  +  w2/12  are  used,  then  the  variance  of  the  resulting  ratio 
will  be  the  same  as  the  variance  of  the  ratio  given  in  (1),  but  the  mean 
will  be  deficient  by  a  factor  of  1/(1  +  a2/12).  If  slR  is  used  in  the 
numerator  and  <r\  in  the  denominator,  the  mean  will  be  excessive  by  a 
factor  of  (1  +  a2/ 12)  and  the  variance  by  a  factor  of  (1  +  a2/12)2. 
Consequently, 

(3)  Determinations  of  confidence  limits  for,  and  statistical  tests  of 
hypotheses  regarding,  the  value  of  cr2  should  be  based  on  slR  as  implied 
by  (1)  and  will  be  valid  to  the  extent  that  the  conditions  and  approxima- 
tions thereof  obtain. 

In  conclusion,  it  should  be  remarked  that  the  foregoing  discussion  can 
be  carried  over  to  the  case  where  s2  has  been  obtained  from  an  analysis 
of  variance  of  values  of  x  that  have  been  rounded  to  within  ±w/2  and 
has  n  degrees  of  freedom.  The  only  change  required  is  replacement  of 
N  —  1  by  n  throughout. 

4.5.  Effect  of  Rounding  on  the  Distribution  of  Student's  t 
Let 

tx  =  V*  (*  ~  ^  (15) 


and 

VN  (xr  -  mL) 


S*R 


(16) 


where  x  and  xR  are  as  in  (1)  and  (11),  sx  and  sx  are  as  in  (2)  and  (14), 
fjLf1:x  is  the  mean  value  of  x  in  the  population,  and  N  is  the  number  of 
independent  values  of  x  (or  of  xR)  in  the  sample. 

When  the  N  independent  observations  involved  are  from  a  normal 
distribution,  the  distribution  of  tx  is  known  as  "Student's  t  distribution,"  in 
recognition  of  the  fact  that  the  distribution  of  z  =  t/\/N  —  1  under 
these  conditions  was  first  given  by  Student  in  the  epochal  paper  from 
which  the  quotation  in  Sec.  1  of  this  chapter  is  taken.  In  his  derivation, 
Student  utilized  the  independence  of  x  and  s  when  computed  from 
independent  observations  from  a  normal  population,  although  he  himself 
merely  established  the  fact  that  the  product  moment  coefficients  of 
correlation  between  x  and  s,  and  between  x2  and  s2,  are  zero  under  these 
conditions.  Independence  of  x  and  s  is  actually  a  necessary  condition. 
A  rigorous  proof  of  their  independence  in  random  samples  from  a  normal 
population  was  communicated  to  Student  by  R.  A.  Fisher  in  correspond- 
ence prior  to  the  First  World  War;  this  first  saw  light,  only  incidentally, 
as  a  preliminary  step  in  Fisher's  paper,  published  in  1915,  on  the  exact 
sampling  distribution  of  the  correlation  coefficient.*     The  first  complete 

*  Fisher,  "Distribution  of  the  Correlation  Coefficient." 
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derivation  of  Student's  distribution  was  given,  in  terms  of  t,  by  Fisher 
in  1926,*  although  he  had  given  a  more  general  definition  of  t  two  years 
earlier,  f 

As  Student  pointed  out  in  the  passage  quoted  in  Sec.  1  of  this  chapter, 
there  is  a  correlation  between  the  sample  mean  and  the  sample  standard 
deviation  in  random  samples  of  rounded  observations,  resulting  from  the 
rounding,  so  that  xr  and  sx  will  not  be  independently  distributed  even 
when  x  and  sx  are  independent.  Nevertheless,  Student's  experimental 
results  suggest  that  the  distribution  of  tx  will  closely  approximate  that 
of  tx  when  N  is  large  enough  to  render  negligible  the  discontinuities  of 
the  distribution  of  tx  .  That  this  is  so  is  evident  from  a  study  of  the 
first  four  moments  of  tx  for  the  case  of  x  having  an  arbitrary  distribution 
subject  only  to  the  condition  that  its  first  four  moments  exist.  The 
first  four  moments  of  tx  can  be  evaluated  readily,  under  this  assumption 
regarding  x,  from  the  expressions  for  the  first  four  moments  of  t  in  terms 
of  the  semi-invariants  of  x,  given  by  Geary,  f  The  first  four  moments  of 
tx  are  given  in  Table  4.3,  where  they  are  compared  with  the  first  four 
moments  of  Student's  t  distribution,  that  is,  the  moments  of  tx  when  x 
is  normally  distributed.  § 

It  is  evident  from  a  study  of  Table  4.3  that,  to  terms  of  order  1/N, 
rounding  affects  only  the  fourth  moment,  and  that  the  effect  is  clearly 
negligible  in  the  case  of  samples  from  a  normal  population  (y1:x  =  0  and 
72:2  =  0)  for  any  reasonable  rounding,  say,  a  =  w/a  <  1.  It  can  be 
seen  from  a  study  of  Geary's  more  extensive  formulas  that,  when  the 
distribution  of  x  is  symmetrical  but  not  necessarily  normal,  the  first 
and  third  moments  of  tx  are  zero  through  terms  in  1/N2  at  least  and  the 
variance  of  tx  has  its  normal  value  through  the  terms  in  1/N.  Indeed, 
when  x  is  normally  distributed,  the  variance  of  tx  has  its  normal  value 
through  terms  in  1/N2  also.     Therefore,  it  may  be  stated  that 

When  N  is  sufficiently  large  and  w  is  sufficiently  small  to  render 
the  discontinuities  of  the  distribution  of  tx  negligible,  the  distribution  of 
tx    and  tx  will  follow  the  standard  t  distribution  to  about  the  same  order 

R 

of  approximation,  provided  the  distribution  of  x  has  high  order  contact, 
or  the  rounding  lattice  is  located  at  random  with  respect  to  the  admissible 
values  of  x,  or  both. 

*  Fisher,  "  Applications  of  Student's  Distribution." 

f  Fisher,  "Error  Function  of  Well-Known  Statistics." 

J  Geary,  "Distribution  of  Student's  Ratio." 

§  The  moments  of  tx  are  given  to  terms  of  order  1/N;  that  is,  terms  in  the  %  and 
higher  powers  of  1/N  have  been  neglected.  Geary's  general  expressions  for  the  first 
four  moments  are  correct  to  terms  of  order  l/N2.  The  terms  of  order  1/N^  and 
1/iV2  for  tx  can  be  evaluated  with  the  aid  of  the  semi-invariants  of  xr  given  in  the 
footnote  to  expressions  (10)  above. 


202 


TECHNIQUES  OF  STATISTICAL  ANALYSIS 


Chap.  4 


o 

H 

Oh 

'< 


o 

Q 
H 
to 

CO     <j 

rh  pq 


w  * 
<  S 

02 
0 


0 

fe 

w 

o 

K 

<< 
ft 

P 

§ 

w 

o 

Pi 

U 

0 

£ 

-to 

w 
o 

« 

£ 

M 

<! 

p 

H 

o 

U2 

^ 

H 

r/5 

g 

fc 

h 

O 

Eh 

02 
Eh 

fc 

W 

fe 

o 

a 

H 

s 

£ 

& 

O 

81  b 


M       p 

Q      Ph 


o  << 


~k 

^— . y 

0 

3 

+ 

o 

^T^ 

+ 

+ 

<3 

a 

O 

<N  |^ 

<2^ 

H^                             O 

+ 

O 

CO 

£ 

^H 

II 

■  I 

_, N 

" 

iO 

1 

1 

Sr 

1 

5. 

^ 

^ 

1 

£ 

CO 

1 

CO 

1 

• 

3 

IB 

0 

0) 

+ 

s 

„ 

o 

£ 

^""^ 

1                     1 

o 

=+H 

H^ 

o 

*\% 

„  1  0 
1 

+ 

0Q 

o 

+ 

O 

0) 
-(J 

+ 

•* 

+ 

r- 

(35 

".3 

8 

> 

^ 

+ 

^ 

H 
^ 

> 

_H 

■+-> 

<M 

^, 

1> 

C<l 

"£ 

05 

"S 

1 

cn 

I 

2 

£ 

1 

- 

h 

1 

+ 

Ph 
Ph 

1—1 

H-i 

1                       1 

3 
Ph 

*>\>i 

< 

+ 

CO 

a 

o 

o 

3 

3. 

3. 

a. 

a 

£ 

Sec.  5.2.1  ROUNDING  DATA  203 

It  should  be  noted  that  si  ,  and  not  its  adjusted  value  given  in 
Paragraph  (2)  of  Sec.  4.4,  is  to  be  used  in  the  denominator  of  tx  .  If  the 
adjusted  value  is  used,  giving 

,  _  Vn  (xR  -  /4J  ,r~ 

W2 


i 


12 


then,  since  \tXg\  >  \tXjt\,  it  is  clear  that  significant  values  of  tx  will  arise 
too  frequently.  Furthermore,  when  slR  is  less  than  w2/12,  the  value  of 
t'x  will  be  imaginary,  and  in  such  cases  some  convention  regarding  the 
value  of  t'  to  be  recorded  will  be  needed.  In  other  words,  as  Fisher 
has  stated,  "These  [Sheppard's]  adjustments  should  be  used  for  purposes 
of  estimation,  but  not  usually  for  tests  of  significance."* 

The  foregoing  discussion  carries  over  to  the  case  where  sx  has  been 
obtained  by  an  analysis  of  variance,  or  otherwise,  from  values  of  x 
rounded  to  within  ±w/2  and  has  n  degrees  of  freedom,  provided  the 
situation  is  such  that  x  and  sj  may  be  regarded  as  independently  dis- 
tributed.    The  only  change  required  is  to  replace  N  —  1  by  n  throughout. 

5.  EFFECTS  OF  ROUNDING  ON  STATISTICAL  ANALYSES  OF 
SMALL  NUMBERS  OF  OBSERVATIONS  FROM  A  NORMAL  POPULATION 

5.1.  Introduction 

Although  the  results  obtained  above  are  independent  of  the  value  of 
N,  the  discussion  up  to  this  point  has  been  concerned  mainly  with 
cases  involving  a  large  number  of  observations.  An  exception  is  the 
quotation  from  Student  in  Sec.  1,  which  discusses  in  detail  the  effects  of  a 
coarse  rounding  on  samples  of  4  from  a  nearly  normal  population.  From 
this  passage  it  is  clear  that  in  the  case  of  small  samples  the  discontinuities 
of  the  distributions  of  xR  and  si   are  the  most  serious  effect  of  the  round- 


XR 


ing.  For  a  fixed  N,  the  discontinuities  of  the  distributions  of  xR  and  s2Xr 
may  be  expected  to  increase  in  number  (in  any  finite  interval)  but  to 
decrease  in  seriousness  as  the  width  of  the  rounding  interval  w  decreases, 
and,  for  a  fixed  w,  to  increase  in  number  but  decrease  in  seriousness  as 
N  increases;  hence,  it  is  clear  that  largeness  of  N  has  meaning  only  in 
conjunction  with  smallness  of  w. 

5.2.  Exact  Distributions  of  slR  in  Samples  of  2  and  3  from  a  Normal 
Population 

5.2.1.  Illustrative   Distributions. — In   order  to  throw  light   on   the 
joint  effect  of  w  and  N  on  the  distribution  of  slR  in  small  samples  from  a 

*  Fisher,  Statistical  Methods,  Sec.  D  of  Appendix  to  Chap.  III. 
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normal  population,  exact  distributions  of  slR  in  samples  of  2  from  a  normal 
population  have  been  computed  for  w  =  a,  a/2,  and  cr/4.  These  are 
portrayed  in  Figs.   4.1,   4.2,   and  4.3,   where  they  are  compared  with 

.N  =  2,  w=dr 
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Fig.  4.1. — Distribution  of  slR  for  samples  of  2  from  a  normal  population  when  the 
width  of  the  rounding  interval  is  one  standard  deviation,  and  corresponding  continuous 
distributions  of  sj  and  s'J   =  (13/12)si. 


the  distribution  of  si,  which  is  distributed  as  xV2  for  1  degree  of  freedom. 
In  Fig.  4.1  the  distribution  of 


tf-G  +  a)-! 


(18) 


is  shown  also,  in  view  of  (1)  in  Sec.  4.4, 
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The  large  steps  in  the  distributions  of  s|B  shown  in  these  figures 
emphasize  the  discontinuous  character  of  these  distributions.  It 
follows  from  (1)  of  Sec.  4,4  that  when  TV  is  large  the  distribution  of  s'x*  may 
be  expected  to  graduate  the  distribution  of  slB  more  closely  than  will 

N  =  2,  w  =  <r/2 
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Fig.  4.2. — Distribution  of  s^ 
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tinuous  distribution  of  si. 
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sVorl 


for  samples  of  2  from  a  normal  population  when  the 
one-half  standard  deviation,   and  corresponding  con- 


the  distribution  of  s|.  From  Fig.  4. 1  it  is  clear  that  the  improvement  is 
negligible  when  N  =  2  and  w  =  a.  Furthermore,  it  is  clearly  impossible 
to  find  smooth  curves  that  will  approximate  these  step  functions  closely 
at  all  points.  In  other  words,  the  probability  of  s%  or  s'x ,  equaling  or 
exceeding  a  value  u  will  approximate  closely  the  probability  of  si 
equaling  or  exceeding  u  only  for  certain  values  of  u,  principally  those  that 
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are  near  a  point  of  intersection  of  the  distribution  curve  of  si,  or  s^, 
with  the  horizontal  portion  of  a  step  in  the  distribution  of  si  . 

Figure  4.4  compares  the  distribution  of  slR  in  samples  of  3  from  a 
normal  population  when  w  =  a  with  the  distribution  of  si,  which  is  now 
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Fig.  4.3. — Distribution  of  s£R  for  samples  of  2  from  a  normal  population  when  the 
width  of  the  rounding  interval  is  one-quarter  standard  deviation,  and  corresponding 
continuous  distribution  of  s%. 

distributed  as  x2o"2/2  for  2  degrees  of  freedom.  Comparison  of  Fig.  4.4 
with  Fig.  4.1  shows  that  increasing  N  from  2  to  3,  Avith  w  fixed,  has 
decreased  the  magnitudes  of  the  steps,  but  not  markedly. 

5.2.2.  Effect  of  the  Position  of  the  Rounding  Lattice. — Before  attempt- 
ing to  answer  the  question  of  how  large  N  needs  to  be  for  a  given  value 
of  w,  and  vice  versa,  in  order  that  the  approximation  given  in  Paragraph 
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(1)  of  Sec.  4.4  can  be  used  with  confidence,  it  will  be  well  to  digress 
briefly  on  the  effect  of  the  position  of  the  rounding  lattice  on  the  distri- 
bution of  si   when  TV  and  w  are  fixed. 

XR 

N  =  3,  w=dr 
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Fig.    4.4. — Distribution  of  s%R  for  samples  of  3  from  a  normal  population  when  the 
width  of  the  rounding  interval  is  one  standard  deviation,  and  corresponding  continuous 


distribution  of 


si. 


Tables  4.4  to  4.6  give  to  7  decimal  places  the  exact  distributions  of 
slR  for  N  =  2  and  w  =  a,  a/2,  and  0-/4,  respectively,  corresponding  to  two 
positions  of  the  rounding  lattice  that  are  a  half  period*  apart,  when  the 

*  Let  &  denote  the  value  of  x  at  the  mid-point  of   the    ith.   interval, 
£<  -  w/2  <  x  <  &  +  w/2  (i  =  0,  ±1,  ±2,'  • "  •  •  ,  ±  oo ) 
corresponding  to  a  given  location  of  the  rounding  lattice.     Writing 
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sampling  is  from  a  normal  population.     These  two  positions  have  been 
chosen  as  follows: 

Position  I:  The  mean  of  x  in  the  population  coincides  Avith  the 
boundary  between  two  of  the  rounding  intervals.  In  the  notation 
of  the  footnote,  this  means  that  if  the  mean  of  x  in  the  population 
is  chosen  as  the  origin,  so  that  p,'1:x  =  0,  then  6  =  —(2i  —  1)t, 
where  i  is  some  positive  or  negative  integer,  or  zero. 

TABLE  4.4 

Exact  Distributions  of  si    =  (xrx  —  xr2)2/2  for  the  Extreme  Positions  of  the 

Rounding  Lattice  When  the  Width  of  the  Rounding  Interval  Is  One 

Standard  Deviation  and  xi  and  x%  Are  Independent  Observations 

from  a  Normal  Population 

(N  =  2,  w  =  <rx) 


u* 

Probability  that  -^  >  u 

Position  1 

Position  II 

0 

1.0000  000 

1.0000  000 

y2 

0.7291  077 

0.7290  858 

2 

0.2987  666 

0.2987  802 

% 

0.0832  659 

0.0832  609 

8 

0.0152  891 

0.0152  903 

2Vz 

0.0018  126 

0.0018  124 

18 

0.0001  368 

0.0001  368 

^A 

0.0000  065 

0.0000  065 

32 

0.0000  002 

0.0000  002 

Over  32 

0.0000  000 

0.0000  000 

E(slR) 

1.0833  332 

1.0833  334 

V(slB) 

4 

2.3430  509 

2.3430  491 

*  The  values  of  u  shown  include  every  possible  value  up  to  32. 

Position  II:  The  mean  of  x  in  the  population  coincides  with 
the  mid-point  of  a  rounding  interval.  Thus  6  =  —  2iri,  where  i  is 
some  positive  or  negative  integer,  or  zero. 


b 


(*+£) 


6  may  be  used  to  define  the  location  of  the  rounding  lattice.  Thus  6  =  0  when  the 
mid-point  of  the  0th  interval  is  at  the  origin  (x  =0),  and  6  =  -w  when  the  lower 
boundary  of  the  0th  interval  is  at  the  origin.  Clearly,  the  rounding  is  the  same  for 
Q  —  ip  and  d  =  <p  ±  2k-jr,  where  k  is  an  integer. 
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Table  4.  7  gives  to  6  decimal  places  the  exact  distributions  of  si  for 
N  =  3  and  w  =  a,  corresponding  to  these  two  positions  of  the  rounding 
lattice,  when  the  sampling  is  from  a  normal  distribution. 


Exact  Distributions  of  si 


TABLE  4.5 

(xr1  —  xr2)2/2  for  the  Extreme  Positions  of  the 

Rounding  Lattice  When  the  Width  of  the  Rounding  Interval  Is 

One-half  Standard  Deviation  and  xi  and  z2  Are  Independent 

Observations  from  a  Normal  Population 


(iV  =  2,w=|) 


u* 

s2 
Probability  that  — -  >  u 

u* 

s2 
Probability  that  —  >  u 
a1 

Position  I 

Position  II 

Position  I 

Position  II 

0 

H 

A 
% 

2 

1.0000  000 
0.8604  037 
0.5977  898 
0.3792  359 
0.2183  109 
0.1134  753 
0.0530  515 
0.0222  398 
0.0083  396 
0.0027  919 
0.0008  331 
0.0002  213 
0.0000  523 

1.0000  000 
0.8604  037 
0.5977  898 
0.3792  360 
0.2183  110 
0.1134  754 
0.0530  515 
0.0222  398 
0.0083  396 
0.0027  919 
0.0008  331 
0.0002  213 
0.0000  523 

16% 
4% 

22  A 

32 

Over  32 

0.0000  110 
0.0000  021 
0.0000  004 
0.0000  001 
0.0000  000 

0.0000  110 
0.0000  021 
0.0000  004 
0.0000  001 
0.0000  000 

2% 

A 

4% 

1.0208  312 

1.0208  314 

8 

81A 
2A 

2.0839  751 

2.0839  755 

12H 

18 

*  The  values  of  u  shown  include  every 


value  up  to  32. 


The  probabilities  given  in  these  tables  were  evaluated  with  the  aid  of  a 
15-place  table  of  the  probabilities  for  normal  deviates,*  and  are  believed 
to  be  correct  to  within  1  unit  in  the  last  place  shown.  The  calculations 
were  carried  to  10  decimals  or  more  at  all  stages  and  were  carefully 
checked.  The  individual  probabilities  obtained,  when  rounded  to  the 
number  of  places  shown,  sum  to  unity  within  ±  1  in  the  last  place  shown. 

From  these  tables,  and  the  fact  that  Positions  I  and  II  represent 
opposite  extremes  in  their  effects  on  s%  ,  the  following  conclusions  may  be 
drawn : 

(1)  When  N  =  2,  the  probability  that  si  J  a2  equals  or  exceeds  u 
is  unaffected  to  4  decimal  places  by  the  position  of  the  rounding  lattice 
if  w  =  a;  to  7  decimal  places,  in  general,  if  w  <  a/2. 

(2)  When  N  =  3,   the  probability  that  si  /a2  equals  or  exceeds  u 


Mathematical  Tables  Project,  Tables  of  Probability  Functions. 


210 


TECHNIQUES  OF  STATISTICAL  ANALYSIS 


Chap.  4 


is  unaffected  by  the  position  of  the  rounding  lattice  to  3  decimals  if 
iv  =  a. 

(3)  The  effect  of  the  position  of  the  rounding  lattice  on  the  distribu- 
tion of  si  appears  to  increase  with  the  number  of  observations  involved, 
that  is,  with  N. 


TABLE  4.6 

Exact  Distributions  of  s|    =  (xr1  —  xr2)2/2  for  the  Extreme  Positions  of  the 

Rounding  Lattice  When  the  Width  of  the  Rounding  Interval  Is 

One-quarter  Standard  Deviation  and  x\  and  rc2  Are  Independent 

Observations  from  a  Normal  Population 


(N  =2, 

w==ai) 

• 

u* 

S2 

Probability  that  —  >  u 

u* 

s2 
Probability  that  —A>u 

Position  I 

Position  II 

Position  I 

Position  II 

0 

1.0000  000 

1.0000  000 

81A 

0.0020  043 

0.0020  043 

3^2 

0.9296  594 

0.9296  595 

3% 

0.0010  902 

0.0010  902 

Vs 

0.7911  480 

0.7911  481 

25A 

0.0005  759 

0.0005  759 

%2 

0.6589  474 

0.6589  474 

441/32 

0.0002  954 

0.0002  954 

H 

0.5366  322 

0.5366  322 

121A 

0.0001  471 

0.0001  471 

2%2 

0.4269  273 

0.4269  273 

59%2 

0.0000  711 

0.0000  711 

% 

0.3315  445 

0.3315  445 

,  18 

0.0000  333 

0  0000  333 

4%2 

0.2511  525 

0.2511  525 

Q2%2 

0.0000  151 

0.0000  151 

2 

0.1854  693 

0.1854  693 

1&% 

0.0000  066 

0.0000  066 

% 

0.1334  464 

0.1334  464 

72%2 

0.0000  028 

0.0000  028 

2Vs 

0.0935  041 

0.0935  041 

49/2 

0.0000  011 

0.0000  011 

X% 

0.0637  758 

0.0637  758 

841A2 

0.0000  004 

0.0000  004 

% 

0.0423  269 

0.0423  269 

22% 

0.0000  001 

0.0000  001 

1Q%2 

0.0273  253 
0.0171  541 

0.0273  253 
0.0171  541 

Over  22% 

0.0000  000 

0.0000  000 

±% 

E(sV 

22%2 

0.0104  691 

0.0104  691 

XR 
2 

1.0052  058 

1.0052  058 

8 

0.0062  099 
0.0035  793 

0.0062  099 
0.0035  793 

«-* 

28%2 

VXsl) 

R 

4 

2.0207  869 

2.0207  869 

*  The  values  of  u  shown  include  every  possible  value  up  to  225/8. 


The  mean  and  variance  of  s\Jg\  for  each  of  the  situations  considered 
in  Tables  4.4  to  4.7  are  given  in  these  tables.  These  values  are  brought 
together  in  Table  4.8,  where  they  are  compared  with  the  corresponding 
average  values  computed  from  the  formulas  of  Table  4.2.  When  s\R 
is  based  on  N  independent  (rounded)  observations  from  a  normal  dis- 
tribution, then  the  formula  of  Table  4.2  for  the  average  variance  of  s2Xr 
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(19) 


where  a  =  w/o-*.     Equation  (19)  gives  the  average  value  of  the  variance 
9f   si     in  a  series  of  repeated  trials  in  each  of  which  N  independent 


TABLE  4.7 


Exact  Distributions  of  s2.    =    y     {xr\  —  xr)2/2  for  the  Extreme  Positions  of 

t.-l 
the  Rounding  Lattice  When  the  Width  of  the  Rounding  Interval  Is 
One  Standard  Deviation  and  xi,  x2,  and  xz  Are  Independent 
Observations  from  a  Normal  Population 

(N  =  3,  w  =  <rx) 


u* 

Probability 

s2 

that  —  >  u 

a1 

u* 

Probability 

s2 
that  —  >  u 

Position  I 

Position  II 

Position  I 

Position  II 

0 

1.000  000 

1.000  000 

9 

0.000  401 

0.000  401 

Vs 

0.915  415 

0.915   156 

28A 

0.000  278 

0.000  278 

1 

0.541   194 

0.541  514 

31A 

0.000  098 

0.000  098 

% 

0.339  213 

0.338  898 

12 

0.000  027 

0.000  027 

% 

0.190  327 

0.190  245 

37A 

0.000  019 

0.000  019 

3 

0.071  985 

0.072  096 

13 

0.000  008 

0.000  008 

4 

0.040  097 

0.040  103 

43A 

0.000  002 

0.000  002 

m 

0.027  447 

0.027  411 

16 

0.000  001 

0.000  001 

m 

0.008  814 
0.005  122 

0  008  808 
0.005  122 

Over  16 

0.000  000 

0.000  000 

19A 

E(slJ 

7 

0.002  200 

0.002  206 

1.083  330 

1.083  336 

2% 

0.000  629 

0.000  629 

«x 

^(•y 

< 

1.170  804 

1.170  831 

*  The  values  of  u  shown  include  every  possible  value  up  to  16. 

observations  are  drawn  at  random  from  a  normal  distribution,  the  posi- 
tion of  the  rounding  lattice  being  determined  at  random  independently 
for  each  trial.  Since  the  average  value  of  the  mean  of  slR  is  unaffected 
by  the  nature  of  the  distribution  of  x,  its  value  is  given  by  the  formula 
of  Table  4.2  in  any  case.     From  Table  4.8  it  is  seen  that 

*  When  N  is  large  so  that  (N  —  1)/N  =  1,  formula  (19)  agrees  through  the  term 
in  o4  with  the  expression  given  by  R.  A.  Fisher,  "  Mathematical  Foundations,"  p.  362, 

N 

for  the  variance  of  v   (xr.  —  Hi:x)2/N  computed  from  N  independent  rounded  observa- 

i  =  l 
tions  from  a  normal  population. 
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(1)  When  N  is  2  or  3  and  w  <  a,  the  mean  values  of  s2  for  Position  I 
and  Position  II  of  the  lattice  agree  with  each  other  and  Avith  their 
theoretical  average  values,  to  5  decimal  places  or  more. 

(2)  When  N  is  2  or  3  and  w  =  a,  the  variances  of  s2.  for  Position  I 
and  Position  II  of  the  lattice  agree  with  each  other  and  with  their  theo- 
retical average  values  to  2  decimal  places  or  more;  when  w  <  a/2,  to 
3  decimal  places  or  more. 

TABLE  4.8 
Comparison  of  Actual  Values  of  the  Mean  and  Variance  of  s2x  /al  with  the 

XRI       X 

Values  Implied  by  the  Formulas  of  Table  4.2  When  the  Width  of  the  Rounding 
Interval  Is  One  Standard  Deviation  (and  Less)  and  si    Is  Based  on  Two  or 

\  XR 


Threi 

]  Independent  Observations  from  a  Normal  Distribution 

Number  of 
observations, 

Width  of 
rounding 
interval, 

Lattice 
position* 

Mean  value  of  si  lal 

XR>        X 

Variance  of  si   lal 

XRI        X 

N 

o-  units 

Actual 

By  formula 

Actual 

By  formula 

2 

1 

I 

1.0833  322 

1.0833  333 

2.3430  509 

2.3423  322 

1 

II 

1.0833  334 

1.0833  333 

2.3430  491 

2.3423  322 

X 

I 

1.0208  312 

1.0208  333 

2.0839  751 

2.0839  300 

H 

II 

1.0208  314 

1.0208  333 

2.0839  755 

2.0839  300 

H 

I 

1.0052  058 

1.0052  083 

2.0207  869 

2.0208  711 

X- 

II 

1.0052  058 

1.0052  083 

2.0207  869 

2.0208  711 

3 

l 

I 

1.083     330 

1.083     333 

1 . 170     804 

1.170  .351 

l 

II 

1.083     336 

1.083     333 

1.170     831 

1 . 170     351 

*  For  the  exact  meanings  of  Position  I  and  Position  II,  see  Section  5.2.2. 

Fisher*  has  studied  the  effect  of  the  position  of  the  rounding  interval 
on  the  mean  values  of  xR  and  s2  in  samples  from  a  normal  distribution 
using  an  analytical  approach  based  on  the  periodic  character  of  such 
effects  noted  in  the  first  footnote  of  this  section.  He  has  calculated 
that  the  bias  of  xR  due  to  the  position  of  a  rounding  lattice  with  intervals 
one  standard  deviation  wide  will  be  less  than  one-tenth  the  standard  error 
of  xR  for  values  of  N  less  than  1.379  X  1016.  For  si  the  corresponding 
value  of  N  is  1.75  X  1014;  for  the  variance  of  s2Xr,  it  is*  1.34  X  1012. 

From  (1)  and  (2)  above  and  from  Fisher's  analysis,  we  conclude  that 

(3)  The  effect  of  the  position  of  the  rounding  lattice  on  the  mean  and 
variance  of  xR  and  si  in  samples  from  a  normal  distribution  is  minute 
in  comparison  to  the  average  effects  of  the  rounding. 

(4)  In  small  samples  (from  a  normal  distribution  at  least),  so  far  as 
the  approximation  of  the  distributions  of  xR  and  s2Xr  by  continuous 
distributions  is  concerned,  the  discontinuities  of  the  distributions  of  xR 

*  Fisher,  "Mathematical  Foundations,"  pp.  318-321. 
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and  si  ,  caused  by  the  rounding,  are  far  more  serious  than  the  effects 
of  the  position  of  the  rounding  lattice. 

5.3.  Combinations  of  Sample  Sizes  and  Rounding -interval  Widths  for 
Which  the  Actual  Distribution  of  s|B  Can  Be  Approximated  Satis- 
factorily by  a  Continuous  Distribution,  When  the  Sampling  Is  From 
a  Normal  Population 

We  have  not  found  a  rigorous  answer  to  the  question  of  how  large 
the  sample  size  N  needs  to  be  for  a  given  rounding-interval  width  w  and 
how  small  w  needs  to  be  for  a  given  value  of  N  before  the  x2  approxima- 
tion given  in  Paragraph  (1)  of  Sec.  4.4  will  adequately  represent  the  dis- 
tribution of  si  for  most  practical  purposes.  Since  the  discontinuities  of 
the  distribution  of  si  are  the  principal  source  of  the  difficulties  encoun- 
tered in  attempting  to  represent  the  actual  distribution  of  slR  by  a  con- 
tinuous distribution  when  N  is  small  and  w  is  comparatively  large,  it  is 
clear  that  an  answer  depends  on  finding  the  combinations  of  N  and  w  for 
which  the  discontinuities  of  the  distribution  of  sj  over  any  given  interval 
are  sufficiently  numerous  and  the  steps  sufficiently  small  to  render  them 

negligible. 

TABLE  4.9 
Probability  That  a  Zero  Value  of  si    Will  Be  Obtained  When  si    Is  Based  on 

XR  XR 

N  Independent  Rounded  Observations  prom  a  Normal  Distribution 


Number  of 

Width  of  rounding  interval  in  a  units  (a  =  w/a) 

observations,  N 

1 

H 

H 

H 

2 
3 

4 
5 

6 

7 

0.271 
0.085 
0.028 
0.009 
0.003 
0.001+ 

0 . 1396 
0.0225 
0.0038 
0 . 0007 
0.0001+ 

0 . 0703 
0.0057 
0.0005- 

0 . 0302 
0.0014 

o.oooi- 

The  combinations  of  N  and  w  for  which  the  discontinuities  of  the 
distribution  of  si  are  negligible  when  the  sampling  is  from  a  normal 
population  could  be  determined  by  preparing,  for  wide  ranges  of  values 
of  N  and  w,  tables  of  the  actual  distribution  of  si  analogous  to  Tables 
4.4  to  4.7.  The  arithmetical  labor  of  the  calculations  involved  in  the 
preparation  of  such  tables  is  so  enormous,  however,  that  it  seems  advisable 
to  devise  some  rule  of  thumb,  which  is  intuitively  sound,  as  a  substitute 
for  rigorous  answers  until  the  latter  become  available.  Accordingly, 
we  propose  that  the  probability  of  sj  equaling  zero  for  given  values  of  N 
and  w,  when  the  sampling  is  from  a  normal  population,  be  used  as  a  basis 
for  deciding  whether  a  given  pair  of  values  of  N  and  w  is  satisfactory.     To 
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be  more  explicit,  we  propose  that  P{sXr  =  0\N,w}  <  0.001,  evaluated 
on  the  assumption  that  the  sampling  is  from  a  normal  population,  be 
adopted  as  a  tentative  definition  of  the  combinations  of  values  of  N 
and  w  for  which  the  x2  approximation  given  in  (1)  of  Sec.  4.4  provides  a 
satisfactory  approximation  to  the  distribution  of  s|  for  practical 
purposes. 

Values  of  P{sXR  =  0\N,w},  for  sampling  from  a  normal  population, 
are  given  in  Table  4.9  for  various  combinations  of  values  of  N  and  w, 
the  latter  being  expressed  in  units  of  the  population  standard  deviation. 
It  appears  from  this  table  that  satisfactory  combinations,  where  a  =  w/a, 
are 


a  <  xAoo, 

N  >  2 

(by  extrapolation) 

a  <  Ho, 

N  >  3 

a<H, 

N  >  4 

(20) 

a<y2, 

N  >  5 

a  <  H, 

N  >  6 

a  <  1, 

N  >  7 

The  fact  that  the  combinations  (20)  represent  a  somewhat  arbitrary 
definition  of  the  circumstances  under  which  (1)  and  (3)  of  Sec.  4.4  may  be 
expected  to  yield  satisfactory  results  in  practical  work  should  not  be  lost 
sight  of.  Nor  should  the  combinations  (20)  be  regarded  as  competing 
with,  or  contradicting,  the  general  recommendation,  cited  in  the  first 
paragraph  of  Sec.  4.1,  that  w  be  taken  less  than  a/3,  or  better,  less  than 
cr/4.  The  object  of  this  general  recommendation  is  to  limit  the  inevitable 
loss  of  precision  of  estimates  of  parameters  caused  by  rounding,  and 
relates  to  the  case  where  the  choice  of  a  rounding  interval  is  yet  to  be 
made.  Combinations  (20),  on  the  other  hand,  are  not  concerned  with 
the  loss  of  precision  of  estimates  as  a  consequence  of  coarse  rounding 
but  are  proposed  as  a  rough  indication  of  the  circumstances  under  which 
data  can  be  validly  analyzed  by  simple  modifications  of  standard 
procedures. 

5.4.  Distribution  of  Students  t  and  the  Variance  Ratio  When  Based  on 
a  Small  Number  of  Rounded  Observations 

Like  the  distributions  of  all  functions  of  rounded  observations,  the 
distribution  of  tXR,  defined  in  equation  (16)  of  Sec.  4.5,  is  discontinuous. 
However,  since  the  discontinuities  of  the  distribution  of  xR  and  the 
distribution  of  sx  jointly  affect  the  distribution  of  tx  ,  its  discontinuities 
are  more  numerous  in  any  given  interval  than  those  of  the  distributions 
of  xR  and  sx  ,  and,  generally,  the  steps  are  smaller  than  those  of  the  dis- 
tribution of  sXr.  On  the  other  hand,  whenever  sXr  equals  zero,  this 
causes  tx  to  be  either  -f  oo  or  -  °o ;  according  to  the  sign  of  the  numerator 
with  the  exception  of  those  cases  in  which  xR  equals  the  population  mean 
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//1:x,  when  tx  can  be  denned  to  be  equal  to  +1  or  —  1,  the  sign  being 
determined  in  a  given  instance  by  a  random  mechanism  giving  each  sign 
equal  probability.*  The  last-mentioned  situation  cannot  occur  unless 
fi[.x  is  a  multiple  of  1/N  and  Avill  occur  very  rarely  even  when  it  is  a 
possibility.  On  the  other  hand,  it  can  be  seen  from  Table  4.9  that  infinite 
values  of  tx  may  occur  with  annoying  frequency  in  very  small  samples, 
for  example  N  <  4,  when  coarse  rounding  is  involved.  If,  however, 
the  rounding  employed  with  a  given  N  is  in  accord  with  the  recommenda- 
tions (20),  then  the  probability  of  an  infinite  value  of  tx  will  be  less  than 
0.001.  Furthermore,  the  rounding  so  chosen  will  simultaneously  render 
the  steps  in  the  distribution  of  tx  negligible,  so  that  for  practical  purposes 
tx  may  be  regarded  as  following  the  standard  t  distribution,  which  is 
strictly  valid  only  for  tx. 

Similar  considerations  apply  to  the  distribution  of  the  ratio  FR  of  two 
mean  squares,  s\x  and  s\x ',  which  are  such  that  in  the  absence  of  the 
rounding  n\s\  and  n2s\  would  be  distributed  independently  as  x2°"2  f°r  ni 
and  n2  degrees  of  freedom,  respectively.  In  particular,  FR  =  s2x  /s\x 
will  equal  0  whenever  s\x  is  0;  it  will  equal  infinity  whenever  s\x  is  0; 
it  may  be  defined  to  be  1  when  both  s\Xr  and  s\Xr  are  0.  Four  exact 
distributions  of  FR  were  calculated  from  the  distributions  of  sx  given  in 
Tables  4.4  to  4.7.  The  cases  considered  were  ri\  =  n2  =  1,  w  =  a,  a/2, 
<r/4;  rii  =  n2  =  2,  w  =  a.  As  expected,  the  discontinuities  of  FR  were 
more  numerous  in  any  given  interval  than  the  discontinuities  of  the  dis- 
tribution of  si  alone,  and  the  steps  were  smaller.  The  biggest  steps 
were  those  corresponding  to  FB  =  0,  FR  =  1,  and  FR  =  +oo.  From 
the  graphs  that  were  sketched  it  was  apparent  that  if  these  three  steps 
were  reduced  to  negligible  size,  ekher  by  increasing  ni  and  n2  or  by 
decreasing  w,  then  the  distribution  of  FR  would  be  represented  by  the  F 
distribution  to  an  approximation  sufficient  for  most  practical  purposes. 
It  is  believed  that  if  the  JVi  (  =  fti  +  1),  N2  (=  n2  +  1),  and  the  common 
rounding  interval  satisfy  the  recommendations  (20)  above,  then  FR  may 
be  regarded  as  distributed  according  to  the  F  distribution  for  most  prac- 
tical purposes. 

If  s\x  and  s\x  correspond  to  different  roundings  Wi  and  w2,  then  the 
distribution  of  FR  will  be  biased  relative  to  the  distribution  of  F.  This 
bias  cannot  be  eliminated  when  a2  is  unknown,  the  situation  in  which  F 
tests  are  customarily  employed. 

6.  STATISTICAL  PROPERTIES  OF  TABULAR  AND  INTERPOLATED  VALUES 

It  is  often  stated,  or  implied,  in  statistical  texts  that  the  error  of  a 

tabular  entry  may  be  regarded  as  a  random  variable  distributed  with  a 

*  The  complication  of  this  definition  arises  because  F  =  %  =  1  seems  a  sensible 
definition,  and  t  =  %  =0  also  seems  sensible,  but  t2  =  F.  The  complication  avoids 
the  implication  that  02  =  1. 
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uniform  probability  density  between  minus  %  unit  and  plus  Y^  unit  in  the 
last  place.  Clearly  the  rounding  error  of  a  given  tabular  entry  has  a 
definite  value,  although  it  may  be  unknown  to  a  user  of  the  table.  Hence 
the  error  of  a  given  tabular  entry  is  not  a  random  variable  in  any  meaning- 
ful sense  of  the  expression,  its  distribution  being  concentrated  at  a  point. 
Therefore,  the  statement  appearing  in  statistical  texts  must  be  intended 
to  mean  that  when  one  is  carrying  out  computations  involving  the  use 
of  a  large  number  of  different  tabular  numbers,  the  errors  of  the  indi- 
vidual tabular  numbers  considered  collectively  will,  under  favorable 
circumstances,  range  from  minus  3^2  unit  to  plus  3^  unit  in  the  last  place 
with  approximately  uniform  density. 

The  circumstances  may  sometimes  be  quite  unfavorable.  Thus  in  a 
given  computation  involving  the  use  of  numbers  from  only  a  small 
section  of  a  table — and  in  certain  other  less  easily  specified  cases  also — 
the  errors  may  all  tend  to  have  the  same  sign  or  to  be  bunched  in  some 
other  manner  characteristic  of  the  values  of  the  function  when  expressed 
to  the  given  number  of  decimal  places  for  that  particular  range  of  the 
argument.  Indeed,  the  favorability  of  the  circumstances  may  depend 
on  the  units  in  which  the  function  is  tabulated,  as  the  following  parallel 
portions  of  two  logarithm  tables  illustrate: 


N 

logio 

N 

logeN 

9861 

3.9939 

210 

9.1963  429 

9862 

3.9939 

650 

9.1964  443 

9863 

3 . 9940 

090 

9.1965  457 

9864 

3 . 9940 

531 

9.1966  470 

9865 

3 . 9940 

971 

9.1967  484 

9866 

3.9941 

411 

9.1968  498 

9867 

3.9941 

851 

9.1969  511 

9868 

3 . 9942 

291 

9.1970  525 

9869 

3 . 9942 

731 

9.1971  538 

9870 

3.9943 

172 

9.1972  551 

9871 

3.9943 

612 

9.1973  564 

9872 

3 . 9944 

051 

9.1974  577 

9873 

3 . 9944 

491 

9.1975  590 

9874 

3 . 9944 

931 

9.1976  603 

9875 

3.9945 

371 

9.1977  616 

9876 

3 . 9945 

811 

9.1978  629 

9877 

3.9946 

251 

9.1979  641 

9878 

3 . 9946 

690 

9.1980  653 

9879 

3.9947 

130 

9.1981  666 

These  logarithms  are  given  above  correct  to  7  decimal  places.     From  an 
examination  of  the  digits  in  the  seventh  place  in  the  case  of  the  values  of 
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logio  N,  it  is  apparent  that  if  one  is  calculating  with  the  aid  of  logarithms 
read  from  a  6-place  table  of  logio  N,  and  the  values  of  N  involved  are 
limited  to  the  range  shown  above,  the  errors  of  the  logarithms  will  be 
bunched  around  a  value  of  one-tenth  unit  in  the  sixth  place  and  quite 
definitely  will  not  be  distributed  more  or  less  uniformly  from  minus 
%  unit  to  plus  }/2  unit  in  the  sixth  place.  On  the  other  hand,  if  the 
computations  are  carried  out  with  the  aid  of  a  6-place  table  of  loge  N,  then 
even  though  the  values  of  N  involved  are  confined  to  the  range  shown 
above,  the  errors  of  the  logarithms  may  be  fairly  uniformly  distributed 
between  minus  Jl>  unit  and  plus  J^  unit  in  the  sixth  place,  although 
there  will  be  a  gap  in  the  distribution  of  the  error  in  the  vicinity  of 
two-tenths  unit  in  the  last  place. 

Since  there  is  a  definite  relationship  between  the  exact  values  of  a  func- 
tion for  successive  values  of  the  argument,  there  is  likewise  a  definite 
relationship  between  the  errors  due  to  rounding  in  the  tabular  values 
given  correct  to  n  decimal  places.  Hence  the  errors  due  to  rounding  to 
n  places  are  never  actually  equivalent  to  independent  observed  values  of  a 
random  variable  uniformly  distributed  between  minus  J^  and  plus  }4 
unit  in  the  last  place  retained.  Nevertheless,  if  a  large  number  of  entries, 
corresponding  to  widely  dispersed  values  of  the  argument,  are  examined, 
the  figures  beyond  the  nth  place  will  often  form  decimal  fractions  that 
appear  to  be  more  or  less  uniformly  and  independently  distributed 
between  0  and  1  in  units  of  the  last  figure  retained;  in  some  cases,  this  is 
true  even  for  relatively  small  values  of  n.  Furthermore,  when  the 
function  tabulated  is  irrational  or  transcendental,  the  larger  the  value 
of  n,  the  more  nearly  do  the  decimals  beyond  the  nth  place  form  decimal 
fractions  that  seem  to  be  uniformly  and  independently  distributed,  for  a 
given  spacing  of  the  argument.  There  are  exceptions,  of  course.  For 
example,  the  functions  10m/9,  10m/99,  10m/999,  etc.  for  m  =  0,  ±1,  ±2, 
.  .  .  ,  which  have  only  0's  and  l's  in  their  decimal  expansions,  do  not 
exhibit  either  of  the  aforementioned  properties.  Similarly,  tables  of 
good  functions  may  have  some  bad  patches.  However,  the  fact  that 
the  above  properties  are  quite  common,  especially  in  the  case  of  decimal 
expansions  of  irrational  and  transcendental  numbers,  suggests  that  in 
many  cases  it  may  be  useful  to  regard  the  errors  in  tabular  numbers  due 
to  rounding  as  uniformly  and  independently  distributed  between  plus  J^ 
and  minus  J^  unit  in  the  last  place  tabulated — provided  one  exercises 
reasonable  judgment  in  applying  the  deductions  from  this  assumption 
in  individual  cases. 

Let  us  assume  then,  with  the  foregoing  reservations,  that  when  the 
error  of  a  tabular  number  due  to  rounding  lies  between  +  w/2,  the  error 
€  is  distributed  with  uniform  probability  density  between  +w/2.  It 
follows  immediately  that 
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+- 
1      f    2 
Mean  error  =  -    /         e  de  =  0  (21) 

W  J  _w 
2 

+- 
1       /"      2  W2 

a\  =  variance  of  errors  =  —   /         e  de  =  — ^  =  0.0833w2         (22) 

w  J  _w  1^ 

2 

whence  the  standard  deviation  of  the  errors  is 

cre  =— %=  =  0.2887w  (23) 

712 


Consequently,  if  rounding  errors  are  viewed  from  this  statistical  view- 
point, equation  (21)  implies  that  rounding  is  unbiased,  that  is,  the  average 
difference  between  the  rounded  tabular  entry  and  the  true  value  of  the 
function  will  be  zero.  Maintaining  the  statistical  viewpoint,  it  follows 
that  the  sum  of  the  errors  of  k  entries  that  are  " correct"  to  within 
±w/2,  assuming  independence  of  the  errors,  will  be  distributed  about 
a  mean  of  zero  with  a  standard  deviation  of  w  -\/k/12;  furthermore, 
the  distribution  is  approximately  normal  (Gaussian)  for  k  >  4.* 

As  an  example  of  the  use  of  this  result,  consider  a  tabulation,  sup- 
posedly correct  to  5  decimals,  of  the  nine  terms  in  the  expansion  of 
(p  +  <?)8  f°r  V  —  t/6.  The  sum  of  the  nine  values  is  1.00003.  Does 
this  discrepancy  of  0.00003  from  the  true  sum  suggest  that  a  mistake 
has  been  made  in  the  calculations?  The  maximum  admissible  dis- 
crepancy is  bounded  by  ±0.000045,  so  there  is  no  positive  proof  that  the 
calculations  are  in  error.  The  observed  discrepancy,  measured  in  units 
of  0.00001,  corresponds  approximately  to  a  normal  deviate  of 

^3  =  3.46. 


Since  the  probability  of  a  normal  deviate  differing  from  0  by  more  than 
2.576  is  less  than  0.01,  f  it  is  advisable  to  check  the  calculations  (prob- 
ably starting  with  the  sum). 

Fisher  and  Wishartf  have  used  the  statistical  viewpoint  to  determine 
the  distribution  of  the  error  of  an  interpolated  value  when  the  values 
of  the  argument  are  equally  spaced.  Let  the  spacing  of  the  argument  be 
taken  as  unity,  and  let  0  to  1  represent  the  interval  containing  the  value 

*  For  details,  see  the  papers  by  Baten,  Hall,  Irwin,  and  Rietz  listed  in  Sec.  7. 

f  This  may  be  verified  from  Table  1.1  of  Chap.  1,  since  the  probability  is  2(0.005) 
that  a  normal  deviate  will  be  either  above  -K"0.oo5  or  below  —  K0.oo&- 

$  Fisher  and  Wishart,  "On  the  Distribution  of  the  Error  of  an  Interpolated 
Value." 
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of  the  argument  p  for  which  it  is  desired  to  obtain  an  interpolated  value ; 
then,  if  the  interpolation  is  performed  without  recourse  to  differences, 
using  Lagrangian  interpolation  coefficients,*  the  error  of  the  interpolate 
can  be  expressed  simply  in  terms  of  the  tabular  entries  used  to  obtain  it. 
It  is,  in  fact,  the  " interpolate"  of  the  errors: 

Error  of  linear  (2-point)  interpolate  =  (1  —  p)e0  +  pei 

Error  of  quadratic  (3-point)  interpolate 


=  p(p  ~  l)6-i  +  2(1  -  p2)e0  +  p(p  +  1)6! 
2 

TO 

Error  of  (2m) -point  interpolate  =        >        Alm(p)  -  ek 

k=-m+l  — 

m 

Error  of  (2m  +  1) -point  interpolate         =      \     Alm+1(p)  •  ek 


(24) 


If  the  errors  of  the  tabular  entries  may  be  regarded  as  independent 
random  variables  with  a  common  variance  a2,  then  the  variance  of  an 
r-point  interpolate  for  argument  p  is  simply 

V(p,r)  -  ^  Ml&W  '  »"  (25) 

k 

where  the  summation  is  over  k  from  —  m  +  1  to  m  when  r  =  2m,  and 
over  k  from  —  m  to  +m  when  r  =  2m  +  1.     In  particular  f 

V(p,2)  =  variance  of  2-point  Lagrangian  interpolate  (26) 

=  [(1  -  p)2  +  pV  =  [1  -  2p(l  -  p)W      0  <  p  <  1 
and 

V(p,S)  =  variance  of  3-point  Lagrangian  interpolate  (27) 

=  [p\p  -  l)2  +  4(1  -  p2)2  +  pKp  +  1)  V 

4 
=  [6p2(p2  -  1)  +  4k2         Q<v<1 
4  —  v  — 

It  can  be  determined  readily  by  differentiation  that  the  minimum 
value  of  V(p,S)  corresponds  to  p  —  ±  l/\/2  and  equals 

^  =  0.6250(V2 
o 

*  For  numerical  values  of  these  coefficients  for  3-point  up  to  11-point  interpolation, 
see  Mathematical  Tables  Project,  Tables  of  Lagrangian  Interpolation  Coefficients; 
formulas  for  these  coefficients  are  given  in  the  introduction  to  the  tables. 

f  Expression  (26)  is  identical  to  that  given  by  Fisher  and  Wishart,  p.  914.  In 
place  of  (27)  they  give  an  equivalent  expression  in  terms  of  a  variable  p  that  varies 
from  0  to  1  when  the  p  of  (27)  varies  from  —  >£  to  +}£. 
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which  exceeds  the  minimum  value  of  V(p,2)f  namely  <r2/2  =  0.50000c-2, 
which  corresponds  to  p  =  }i.  Also,  for  0  <  p  <  1,  it  is  clear  that  the 
maxima  are  7(0,2)  =  7(0,3)  =  7(1,2)  =  7(1,3)  =  a2.  Table  4.10 
shows  the  minima  and  the  average  values  of  V(p,r),  0  <  p  <  1,  in  the 
case  of  (2ra)-point  interpolation  (m  =  1,  2,  3,  4),  and  the  practical 
minima*  and  the  average  values  for  —  K  <  P  <  M  in  the  case  of 
(2m  +  l)-point  interpolation  (m  =  0,  1,  2,  3). 

TABLE  4.10 
Minimum  and  Average  Values  of  the  Variance  of  an  t-point  Interpolate  for 
an  Argument  between  0  and  1  When  r  Is  Even,  and  between  —  }£  and  +  ;Hj  When 
r  Is  Odd,  the  Values  of  the  Argument  Being  Spaced  at  Unit  Intervals  and  the 
Errors  of  the  Tabular  Entries  Being  Independently  Distributed  with  a 

Common  Variance  <t2 


Interpolate 

Minimum  Variance* 

(units  of  a2) 

Average  Variance  f 
(units  of  <r2) 

1-point  (an  entry) 

1.00000 

1.00000 

2-point  (linear) 

0 . 50000 

0.66666 

3-point 

0.71875 

0 . 89375 

4-point 

0.64062 

0.77566 

5-point 

0.74060 

0.89574 

6-point 

0 . 70599 

0.82244 

7-point 

0.76579 

0.90291 

8-point 

0.74533 
1.00000 

0.84942 

infinite-point 

1.00000 

*  In  the  case  of  (2m) -point  interpolation,  the  values  shown  are  actual  minima,  occurring  at  p  =  J£; 
and  in  the  case  of  (2m  +  l)-point  interpolation,  they  are  practical  minima,  occurring  at  p  =  ±K- 
For  explanation,  see  text. 

t  From  Fisher  and  Wishart,  pp.  914-915.  The  average  has  been  taken  over  the  interval  0  <  p  <  1 
in  the  case  of  (2m)-point  interpolates;  over  the  interval  —}4  <  V  <  +  M  in  the  case  of  (2m  +  l)-point 
interpolates. 


In  summary,  if  the  errors  of  the  tabular  entries  may  be  regarded  as 
independent  random  variables  with  a  common  variance  a2,  if  the  values 
of  the  argument  are  uniformly  spaced  at  unit  intervals,  and  if  r-point 
interpolation  for  the  interpolate  corresponding  to  argument  p,  with 
0  <  p  <  1  for  r  even  and  —  J^  <  V  <  +Mf°rr  °dd,  is  carried  on  with- 
out differences,  then 

(1)  The  variance  of  the  random  error  of  the  interpolate  is  always 
less  than  the  variance  of  a  single  tabular  entry. 

*  In  the  case  of  (2m  +  l)-point  interpolation  for  Y2  <  p  <  1,  a  more  accurate 
interpolate  can  be  obtained  by  taking  p  =  1  as  origin,  dropping  the  tabular  entry 
for  k  =  —m,  adding  the  entry  for  k  =  m  +  1,  and  using  1  —  p  as  argument.  With 
this  convention,  V(p,  2m  +  1)  will  have  a  practical  minimum  at  p  =  ±%,  although 
actual  minima  occur  at  p  =   ±p'  with  \p'\  >  %. 
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(2)  The  variance  of  the  random  error  of  the  interpolate  is  least  for 
linear  (2-point)  interpolation. 

(3)  The  variance  of  the  random  error  of  a  (2m  +  2) -point  inter- 
polate is  greater  than  the  variance  of  the  random  error  of  a  (2m)-point 
interpolate,  but  less  than  the  variance  of  the  random  error  of  a  (2m  +  1)- 
point  interpolate. 

(4)  The  variance  of  the  random  error  of  a  (2m  +  3)-point  inter- 
polate is  greater  than  the  variance  of  the  random  error  of  a  (2m  +  1)- 
point  interpolate. 

From  (3)  and  (4)  it  is  clear  that,  if  (2m)-point  interpolation  does  not 
yield  the  accuracy  desired,  but  either  (2m  +  1) -point  or  (2m  +  2)- 
point  will,*  then  it  may  be  advisable  to  employ  (2m  +  2)-point  rather 
than  (2m  +  1) -point  interpolation  in  order  to  minimize  the  loss  of 
precision. 

It  should  be  noted  that  these  results  require  merely  that  the  errors 
of  the  successive  tabular  entries  be  regarded  as  independently  distributed 
with  a  common  variance — it  is  not  assumed  that  the  errors  are  due  solely 
to  rounding  and  are  distributed  with  uniform  probability  density  between 
—  w/2  and  -\-w/2.  The  results  are,  of  course,  applicable  to  this  case, 
with  a-2  =  w2/12,  as  given  by  (22),  when  the  independence  assumption  is 
tenable.  Fisher  and  Wishart  point  out,  however,  that  in  this  latter 
case  the  distribution  of  the  error  of  the  interpolate  is  more  or  less  intrac- 
table, and  does  not  tend  to  the  normal  (Gaussian)  distribution  (/32  tends 
to  2.2  and  not  to  3)  as  r  increases.  This  somewhat  limits  the  usefulness 
of  the  standard  deviation  as  a  measure  of  the  random  error  of  the  interpo- 
late, although  the  inequalities  of  Tchebycheff  and  of  Camp  and  Meidell 
are  still  applicable,  f  Since,  by  (24),  the  error  of  an  interpolate  is  a 
linear  function  of  the  errors  of  the  tabular  entries  when  the  interpolation 
is  carried  out  without  recourse  to  differences  (that  is,  by  using  Lagrangian 
interpolation  coefficients),  the  error  of  the  interpolate  will  be  normally 
distributed  when  the  individual  errors  are  independently  and  normally 
distributed  with  a  common  variance.  Accordingly,  they  note  that  a 
table  printed  to  one  or  two  places  more  than  are  correct  is  capable  of 
yielding  a  more  accurate  and  more  precise  interpolate  than  the  same 
table  with  the  entries  cut  down,  provided  that  the  standard  deviation 
of  an  entry  is  given  and  the  errors  are  approximately  normally  and 
independently  distributed  about  a  mean  of  zero  with  a  common  variance. 

*  The  reader  is  cautioned  against  inferring  that  (r  +  l)-point,  (r  +  2)-point, 
and  so  forth,  interpolation  always  yields  greater  accuracy  than  r-point  interpolation; 
he  is  referred  to  Steffensen,  Interpolation,  pp.  34-53,  for  a  discussion  of  the  number  of 
points  to  be  used  in  a  given  instance  and  for  a  simple  example  in  which  the  accuracy 
becomes  worse  as  r  increases. 

f  See,  for  instance,  Shewhart,  Economic  Control  of  Quality,  pp.  176-177. 
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With  the  exception  of  Sec.  3,  it  is  based  on  a  memorandum  by 
M.  A.  Girshick,  U.S.  Bureau  of  the  Census 


CHAPTER  5 

ACCEPTANCE  INSPECTION  WHEN  LOT  QUALITY  IS  MEASURED 

BY  THE  RANGE 

1.  TYPE  OF  PROBLEM 

The  range,  denned  as  the  difference  between  the  largest  and  smallest 
observations  in  a  sample,  is  familiar  in  industrial  experimentation  and 
quality  control  as  a  measure  of  dispersion  in  samples.  Many  applica- 
tions of  the  range  are  to  normal  or  nearly  normal  populations,  for  which 
the  variability  is  better  characterized  by  the  standard  deviation.  The 
range  is  used  to  measure  dispersion  in  samples  because  of  the  ease  with 
which  it  can  be  determined,  and  because  in  small  samples  it  is  almost 
as  efficient  a  measure  of  variability  as  the  standard  deviation.*  The 
present  chapter  is  concerned  with  an  interesting  reversal  of  these  roles, 
where  the  variability  of  the  lot  or  process  is  characterized  in  terms  of  the 
range  itself  but  where  the  standard  deviation  is  used  as  the  sampling 
statistic  in  acceptance  inspection  to  determine  whether  the  range  meets 
specified  requirements. 

One  type  of  problem  in  which  the  range  is  the  parameter  of  primary 
interest  is  that  of  ensuring  the  virtual  simultaneity  of  a  set  of  events. 
This  set  of  events  may  be  thought  of  as  a  random  sample  of  k  units  from 
an  underlying  population  of  such  events,  and  the  requirement  of  simul- 
taneity as  a  demand  that  the  time  difference  between  the  occurrence  of  the 
first  and  last  event  of  such  a  sample  be  small.  In  practice,  of  course, 
some  samples  will  meet  this  requirement  and  others  will  not,  but  we  can 
formulate  the  requirement  statistically  as  follows:  Let  the  size  of  the 

*  One  statistic  is  more  efficient  than  another  if  it  varies  less  from  sample  to  sample 
for  a  fixed  population,  that  is,  if  its  sampling  variability  is  less.  To  get  a  basis  of 
comparison,  it  is  necessary  to  convert  values  of  one  statistic  into  implied  values  of 
the  other.  For  example,  given  the  range  of  a  sample  of  N  from  a  normal  population, 
an  estimate  of  the  standard  deviation  can  be  made  with  the  aid  of  Tippett's  table  of 
the  mean  range,  in  units  of  the  population  standard  deviation,  for  samples  of  N  from 
a  normal  population.  Such  an  estimate  will  vary  more  from  sample  to  sample  than 
if  the  standard  deviation  had  been  computed  directly  by  the  usual  formula.  Con- 
versely, given  the  sample  standard  deviation,  an  estimate  of  the  sample  range  can  be 
formed;  such  an  estimate  will  vary  less  from  sample  to  sample  than  if  the  range  had 
been  computed  directly.  For  samples  of  2,  however,  the  range  and  standard  deviation 
are  equally  variable;  for  samples  of  3  the  range  is  slightly  more  variable  than  the 
standard  deviation,  and  as  the  sample  size  increases  the  variability  of  the  range 
increases  relative  to  that  of  the  standard  deviation.     (For  full  citations,  see  Sec.  5.) 

Chap.  3,  Sec.  4.4,  pp.  160-163,  discusses  the  efficiency  of  statistics. 
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sample  be  k,  and  let  U  (i  =  1,  2,  •  •  •  ,  k)  be  the  time  of  occurrence 
of  the  ith  event  of  the  sample  measured  from  an  arbitrary  time  origin. 
Denote  by  Rk  the  difference  between  the  longest  and  the  shortest  of  these 
times  in  a  particular  sample.  Then  the  requirement  for  an  acceptable 
lot  is  that  in  not  more  than  a  fraction  y  of  samples  of  k  shall  the  range 
Ek  exceed  a  fixed  value  U.  If  the  time  can  be  measured  for  each  item 
in  the  lot,  it  may  be  possible  to  form  sets  of  k  in  such  a  way  that  the  range 
is  satisfactory  in  all  or  nearly  all  sets.  On  the  other  hand,  if  deter- 
mination of  the  time  alters  or  destroys  the  unit,  if  the  time  varies  from 
one  trial  to  another  for  a  given  unit,  or  if  measurement  of  the  times  and 
systematic  grouping  is  expensive,  there  are  advantages  to  a  procedure 
such  as  that  suggested,  based  on  random  grouping. 

A  typical  problem  of  ensuring  temporal  simultaneity  arises  in  con- 
nection with  mechanisms  for  which  there  is  a  certain  lapse  of  time 
between  activation — for  example,  closing  a  switch — and  operation.  If 
such  mechanisms  are  used  in  assemblies  of  two  or  more  activated  from  a 
single  source,  it  may  be  important  to  the  functioning  of  the  assembly  that 
these  time  lapses  be  nearly  equal,  in  other  words,  that  all  the  operations 
occur  within  a  brief  interval.  This  need  can  be  fulfilled  by  requiring 
that  each  lot  of  the  component  mechanisms  be  such  that  the  range  of 
delay  times  in  random  sets  of  k  mechanisms  be  less  than  some  predeter- 
mined constant.  It  is  not  possible,  Avithout  testing  the  entire  lot,  to 
assure  that  every  set  of  k  formed  at  random  from  a  lot  will  have  an 
acceptable  range;  but  it  is  possible  by  sampling  inspection  to  obtain  any 
required  level  of  assurance,  short  of  certainty,  of  detecting  lots  worse 
than  any  specified  quality  P,  as  measured  by  the  percentage  of  sets  of 
k  in  which  the  range  is  too  large. 

Problems  of  controlling  the  difference  between  the  largest  and  smallest 
measurements  in  a  group  of  k  items  are  not  confined  to  situations  involv- 
ing temporal  variability.  Balance  may  require  that  a  set  of  k  objects 
not  show  too  great  a  range  in  weight;  flatness  or  straightness  may 
require  that  a  set  of  k  items  not  vary  too  much  in  thickness  or  length; 
appearance  may  depend  upon  the  range  of  variation  within  sets  of  k  rather 
than  upon  some  other  type  of  variability. 

2.  METHOD  OF  SOLUTION 

It  is  clear  that  Avhen  acceptance  inspection  is  based  on  samples, 
there  is  always  a  risk  that  when  good  lots  are  offered  they  will  be  rejected, 
or  that  when  bad  lots  are  offered  they  will  be  accepted.  The  former 
risk  depends  on  the  frequency  with  which  good  lots  yield  bad  samples,  and 
the  latter  on  the  frequency  with  which  bad  lots  yield  good  samples. 
These  frequencies  can  be  made  as  low  as  we  please  by  taking  sufficiently 
large  samples  and  choosing  the  acceptance  criterion  properly,  but  they 


Sec.  2  QUALITY  MEASURED  BY  RANGE  229 

cannot  be  eliminated  except  by  complete  inspection  of  the  lot.  Follow- 
ing established  usage,  we  shall  call  the  first  kind  of  risk  the  "  producer's 
risk"  and  the  second  kind  the  " consumer's  risk." 

For  concreteness,  let  us  assume  that  a  lot  of  acceptable  quality 
is  defined  as  one  in  which  the  probability  is  at  least  p\  =  0.99  that  the 
range  of  14  items  selected  at  random  will  be  less  than  0.3,  measured  in 
whatever  units  are  appropriate.  Let  us  suppose  further  that  the  pro- 
ducer's risk  is  to  be  a  small  quantity,  say  a  =  0.01;  that  is  to  say,  if  lots 
of  quality  px  are  offered  we  wish  to  reject  them,  on  the  average,  no 
oftener  than  1  time  in  100.  Suppose  that  a  lot  is  unacceptable  if  the 
probability  is  p2  =  0.85,  or  less,  that  the  range  of  14  items  selected  at 
random  will  be  less  than  0.3;  and  that  the  consumer's  risk — the  proba- 
bility of  accepting  a  lot  of  quality  p2  or  worse  if  offered — is  to  be  at  most 
jS  =  0.01.  Finally,  let  us  assume  that  the  distribution  of  the  quality 
characteristic  is  adequately  represented  by  a  normal  distribution.  * 

For  a  normal  distribution  the  probability  that  the  range  of  k  inde- 
pendent observations  is  less  than  some  preassigned  number  U  depends 
only  on  the  standard  deviation  a.  Thus,  the  criteria  of  acceptable  and 
unacceptable  lots,  which  were  expressed  above  in  terms  of  the  proportion 
of  the  ranges  of  sets  of  14  that  are  less  than  0.3,  can  be  expressed  in 
terms  of  the  lot  standard  deviation  a.  It  follows  that  the  acceptance- 
inspection  plan  can  be  formulated  in  terms  of  the  sample  standard 
deviation,  or,  as  proves  to  be  more  convenient  in  practice,  in  terms  of  the 
sum  and  the  sum  of  squares  of  the  observations.  This  reformulation  of 
the. definition  of  an  acceptable  lot  permits  a  great  gain  in  the  efficiency 
of  the  inspection  plan;  for  in  using  the  sum  of  squared  deviations  as  a 
measure  of  acceptability  we  make  full  use  of  the  information  provided 
by  inspection  of  the  sample,  rather  than  of  just  that  portion  of  the 
information  given  by  selecting  the  extreme  individuals  in  each  group  of 
14  observations. 

Let  <ri  be  the  value  of  the  lot  standard  deviation  that  defines  good 
quality.  Then  cri  depends  on  the  group  size  k,  on  the  acceptable  value 
U  of  the  range  in  groups  of  k,  and  on  the  proportion  of  ranges  that 
can  be  allowed  to  exceed  U.  These  three  quantities  are  related  to  g\ 
through  the  cumulative  distribution  of  the  standardized  range,  which 
gives  the  probability  that  the  range  Rk  of  k  items  from  a  normal  dis- 
tribution will  exceed  any  specified  multiple  Wk  of  the  population  standard 
deviation  a.  Tables  of  the  cumulative  distribution  of  the  range  have 
been  published  by  Pearson,  f     In  Fig.  5.1,  which  is  based  on  Pearson's 

*  If  the  assumption  of  normality  is  not  justified,  the  inspection  can  be  carried  out 
on  an  attributes  basis,  by  classifying  each  group  of  14  in  the  sample  as  satisfactory 
or  not. 

t  Pearson's  table  gives  cumulative  probabilities  to  4  decimal  places  for  values  of  k 
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tables,  the  vertical  axis  shows  the  probability  that  Rk/<r  will  exceed  the 
values  of  W  shown  on  the  horizontal  axis,  and  the  various  curves  cor- 
respond with  various  group  sizes  k.  For  our  purposes,  R  is  given,  W 
is  obtained  from  Pearson's  table  or  from  Fig.  5.1,  and  a  is  the  quotient 


Rk 
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Data  for  these  curves  are  from  E.  S.  Pearson,  Biometrika,  1942.- 

Fig.  5.1. — Cumulative  probability  distribution  of  the  range  in  samples  of  k  from  a  normal 

population. 

For  the  illustration  cited,  we  find  the  value  of  Wu  corresponding  with 
probabilities  of  0.99  and  0.85  on  the  curve  for  k  =  14.  These  are  5.40 
and  4.20,  respectively;  on  substituting  them  and  Rk  =  0.3  into  (1),  we  find 
K8  and  K4  as  the  values  of  <ri  and  o-2. 

Thus  the  problem  becomes  that  of  devising  an  inspection  procedure 
for  the  standard  deviation  such  that  the  operating-characteristic  curve 
passes  through  the  two  points 

(in  his  notation  n)  from  2  to  20  by  steps  of  1  and  for  values  of  W  from  0  to  7.25  by 
steps  of  0.01. 
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<n  =  0.05556        L9i  =  0.99  (2) 

o-2  =  0.07143         L9\  =  0.01  (3) 

Such  an  inspection  procedure  may  be  carried  out  by  either  single  or 
sequential  sampling.     The  solutions  are  indicated  only  briefly  here.  * 

3.  SINGLE  SAMPLING 
To  determine  the  required  sample  size,  we  computet 


® 


1.6531  (4) 


Then  in  a  table  of  x2  we  find  the  number  of  degrees  of  freedom  n  for  which 

4-  =  P  (5) 

Xi-e 

where  xl  denotes  the  value  of  x2  exceeded  with  probability  e. 

In  the  present  example,  since  a  =  (3  =  0.01,  we  may  refer  to  Table  8.2 
and  seek  1.6531  in  the  column  for  /3  =  0.01;  however,  the  table  shows 
only  that  n  >  120.     We  therefore  use  the  large  sample  approximation  J 


=  1  /if  acrx  +  gjg»\ 

2    \         0"2    —    0"l 

=  173 


)  (6) 


where  Ka  and  K p  represent  normal  deviates  exceeded  with  probabilities 
a  and  jS,  respectively,  which  may  be  found  from  Table  1.1  in  Chap.  1. 
Thus  173  is  the  number  of  degrees  of  freedom  required,  and  since  the 
mean  is  unknown  the  required  sample  size  is 

N  =  n  +  1  (7) 

=  174 

From  the  sample  of  174,  we  calculate 

S=l^~^f  (8) 

;  =  i 

where  x%  (i  =  1,  2,  •  •  •  ,  N)  represents  the  value  of  the  ith.  measurement 
in  the  sample.     The  lot  is  rejected  if  S  >  S0,  where 

So  =  crlxlM  (9) 

*  For  a  discussion  of  tests  of  the  standard  deviation  see,  on  single  sampling, 
Chap.  8,  and,  on  sequential  sampling,  Chap.  17  below  and  Statistical  Research  Group, 
Sequential  Analysis,  Sec.  6. 

t  See  Chap.  8,  Sec.  2.4,  especially  Example  III,  pp.  272-274. 

%  See  Chap.  8,  Sec.  2.4,  first  footnote  on  p.  274. 
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in  which  xl(n)  represents  the  value  of  x2  for  n(=  N  —  1)  degrees  of 
freedom  exceeded  with  probability  a;  otherwise  the  lot  is  accepted.  In 
the  present  example,  n  is  outside  the  range  of  the  x2  tables,  so  we  use 
the  approximation  (due  to  R.  A.  Fisher) 

2.  (Kg     +      V2^l)2 

xi(n)  = (10) 

=  218.416 

The  OC  curve  is  easily  obtained  from  the  fact  that  the  probability 
Lff  of  accepting  a  lot  of  quality  a  Avhen  offered  is  simply  the  probability 
that  a  value  of  x2  based  on  n  degrees  of  freedom  will  be  less  than  So/a2. 
When  a  =  0.01  or  0.05  and  n  does  not  exceed  120,  Table  8.1  or  8.2 
provides  13  points  on  the  OC  curve;  values  of  <j2/(j\  are  shown  for  various 
values  of  La.  When  n  is  beyond  the  range  of  the  table,  the  OC  curve 
can  be  found  from 

°  =  sir  -  K (11) 

4.  SEQUENTIAL  SAMPLING 
A  sequential-inspection  plan  based  on  standard  deviations  is  denned 
by  two  parallel  straight  lines  given  by  the  formulas 

Zn  =  h2  +  sn  (rejection  line)         (12) 

Zn  =  —hi-\-sn  (acceptance  line)    (13) 

where  hi,  /12,  and  s  are  positive,  n  represents  one  less  than  the  number  of 
observations  taken  sequentially,  and 

n+l 


Z, 


- 1  -I 


n  +  1 


is  the  sum  of  squares  of  deviations  of  the  n  +  l  observations  from  their 
mean. 

The  sequential  procedure  is  as  follows:  Draw  the  two  parallel  lines 
given  by  equations  (12)  and  (13)  with  Zn  on  the  vertical  scale  and  n  on 
the  horizontal  scale.  Start  taking  observations,  beginning  with  two 
observations  and  thereafter  adding  one  observation  at  a  time.  After  each 
observation  compute  Zn  from  all  observations  and  plot  its  value  against 
the  corresponding  value  of  n  (which  is  one  less  than  the  total  number  of 
observations  thus  far  taken).  If  at  any  stage  the  plotted  point  falls  on 
or  above  the  upper  line,  given  by  equation  (12),  reject  the  lot;  if  it  falls 
on  or  below  the  lower  line,  given  by  equation  (13),  accept  the  lot.  If, 
however,  the  plotted  point  falls  between  the  two  parallel  lines,  take 
another  observation.  Continue  this  process  until  a  decision  of  acceptance 
or  rejection  is  reached. 
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The  two  intercepts  —  hi  and  h2  and  the  common  slope  s,  which  deter- 
mine the  two  parallel  lines,  are  given  by 

2  1oge—- 
hi  =  —j f-  (14) 

=  0.071799 

1  -  8 


and 


2l0ge 

/>2=^ f-  (15) 

erf       crj 
=  0.71799 

log. -| 

«  =  ^j  (16) 

=  0.0039268 

The  OC  curve  for  the  sequential  plan  is  computed  from  the  following 
formulas : 


I    _    e-2ts  et(h!+h2)    _    ethi 

a  =  \~^T"  Lo  =  ««*i+*«>  -  l  (17) 

It  will  be  observed  that  L^,  the  ordinate  of  the  OC  curve,  is  not  given 
directly  as  a  function  of  <r,  but  parametrically  through  a  third  variable  t 
in  terms  of  which  <r  is  also  defined.  The  OC  curve  for  this  example  is 
shown  in  Fig.  5.2. 

The  average  sample  number  (ASN)  curve  shows  the  relationship 
between  the  standard  deviation  in  the  submitted  lot  and  the  average 
number  of  items  that  will  be  inspected  before  a  decision  is  reached.  The 
ordinate  Nff  of  the  ASN  curve,  corresponding  to  the  abscissa  a,  is  given  by 

N.  =  L'{hl  +  h'\  ~  -2  +  1  (18) 

s  —  a2 

where  L„  and  a  are  given  by  equations  (17).  Formula  (18)  can  be  evalu- 
ated directly  for  every  value  of  a  except  a  =  \/s,  for  which  it  can  be 
shown  that 

#^  =  ^  +  1  (19) 

This  will  ordinarily  be  close  to  the  maximum  average  amount  of  inspection 
required  to  reach  a  decision.  The  ASN  curve  for  the  present  example 
is  included  in  Fig.  5.2. 
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Fig.  5.2. — OC  and  ASN  curves  for  sequential-sampling  plan  based  on  standard  devi- 
ation; 0-1=0.05556,  o-2=0.07143,  a  =  0.01,  yS  =  0.01;  hi  =  0.07180,  h*  =  0.07180, 
s  =  0.003927. 
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CHAPTER  G 

VARIABILITY  OF  AMOUNT  OF  INSPECTION 
FOR  DOUBLE,  MULTIPLE,  AND  SEQUENTIAL  SAMPLING 

1.  SINGLE,  DOUBLE,  MULTIPLE,  AND  SEQUENTIAL  SAMPLING 

Sampling  inspection  is  said  to  be  based  on  attributes  if  inspection 
of  each  item  results  in  its  classification  as  defective  or  nondefective,  and 
disposition  of  the  lot  or  process  from  which  the  sample  comes  depends 
upon  the  number  of  defectives  found  in  the  sample.  One  or  more  sam- 
ples are  selected  at  random  from  the  lot ;  if  the  total  number  of  defectives 
found  in  the  first  k  samples  does  not  exceed  an  acceptance  number  Ak, 
the  lot  is  accepted;  if  it  equals  or  exceeds  a  rejection  number  Rk  (Rk  >  Ak), 
the  lot  is  rejected;  if  it  falls  between  Ak  and  Rk,  another  sample  is 
inspected. 

A  single-sampling  plan  is  one  for  which 

Rl  =  Al  +  l  (1) 

A  double-sampling  plan  is  one  for  which 

#1  >  Ai  +  1 

R2  =  A2  +  l  (2) 

A  multiple-sampling  plan  is  one  for  which 

Rk  >  Ak  +  1  for  k  <  2  (3) 

Rk  =  Ak  +  1  for  some  k  >  2 

In  a  sequential-sampling  plan,  the  successive  samples  are  all  of  size  1  and 

Rk-  Ak  =  C  +  5k  (4) 

where  C  is  some  constant  and  each  8k  is  0  or  1.* 

*  For  an  exposition  of  single,  double,  and  multiple  sampling,  see  Statistical 
Research  Group,  Sampling  Inspection;  what  is  here  called  "multiple  sampling"  is 
there  called  " sequential  sampling,"  and  what  is  here  called  "sequential  sampling"  is 
there  called  "item-by-item  sequential  sampling."  For  an  exposition  of  sequential 
sampling  see  Chap.  17  of  this  volume,  or  Statistical  Research  Group,  Sequential 
Analysis.     (For  full  citations,  see  Sec.  5.) 

Equation  (4)  is  a  necessary  condition  for  a  sequential-sampling  plan,  but  it  is  not 
quite  a  sufficient  condition  unless  further  conditions  are  imposed  on  the  5/;. 
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2.  VARIABILITY  OF  AMOUNT  OF  INSPECTION 

One  of  several  factors  determining  the  relative  merits  of  single, 
double,  multiple,  and  sequential  sampling  is  the  variability  of  the  amount 
of  inspection  for  plans  giving  equivalent  operating-characteristic  (OC) 
curves.  Variability  of  the  amount  of  inspection  causes  fluctuations 
in  the  inspection  load  and  thus  may  create  administrative  problems  in 
an  inspection  program. 

Variability  may  be  interpreted  in  two  ways.  First,  it  may  mean 
variation  in  the  average  number  of  items  inspected  per  lot  as  the  quality 
of  submitted  lots  varies;  in  other  words,  the  amplitude  of  the  rise  and 
fall  of  the  average  sample  number  (ASN)  curve.  Second,  it  may  mean 
variation  in  the  number  of  items  inspected  for  lots  of  a  given  quality. 

Clearly,  single  sampling  will  show  less  variability,  in  either  sense, 
than  double,  multiple,  or  sequential  sampling.  In  fact,  with  single 
sampling  there  will  be  no  variability  of  either  kind  unless  inspection  is 
curtailed,  that  is,  terminated  as  soon  as  the  rejection  number  is  exceeded 
or  the  number  of  items  remaining  uninspected  is  insufficient  to  cause 
rejection  even  if  all  are  defective. 

No  general  relations  among  the  variabilities  of  double,  multiple, 
and  sequential  plans  are  known.  The  purpose  of  this  chapter  is  to 
present  two  illustrative  comparisons.  These  two  comparisons  suggest 
by  their  differences  that  no  simple  conclusions  about  the  relative  varia- 
bility of  the  three  types  of  sampling  are  valid.  In  the  illustrations  it  is 
assumed  that  the  first  sample  of  the  double  and  multiple  plans  will  be 
inspected  completely  (this  being  usual*  to  facilitate  estimation  of  the 
process  average),  that  inspection  of  later  samples  will  be  curtailed  as 
soon  as  the  rejection  number  is  reached,  but  that  curtailing  will  not  be 
applied  to  accepted  lots. 

3.  ILLUSTRATIONS 
Example  I.  Table  6.1  presents  a  set  of  double,  multiple,  and 
sequential  plans  whose  OC  curves  are  nearly  the  same.f  Figure  6.1 
shows  their  average  sample  numbers  N  as  functions  of  lot  quality, 
and  Fig.  6.2  shows  their  standard  deviations  o~(N)  of  amount  of 
inspection  for  lots  of  a  given  quality  as  functions  of  lot  quality. 

Example  II.  Table  6.2  presents  a  second  set  of  plans  with  match- 
ing OC  curves  ;t  Fig.  6.3  shows  N  as  a  function  of  lot  quality,  and 
Fig.  6.4  shows  o-(N)  as  a  function  of  lot  quality. 

*  It  is  not,  however,  necessary.  See  Girshick,  Mosteller,  and  Savage, 
"Unbiased  Estimates  for  Certain  Binomial  Sampling  Problems  with  Applications.'' 

t  The  OC  curves  of  the  double  and  multiple  plans  are  shown  in  Table  4  of  Sampling 
Inspection  for  sample  size  letter  H,  acceptable  quality  level  (AQL)  class  0.32-0.65 
percent  defective  (p.  288). 

t  The  OC  curves  of  the  double  and  multiple  plans  are  shown  in  Table  4  of  Sampling 
Inspection  for  sample  size  letter  K,  AQL  class  3.2-4.4  percent  defective  (p.  330). 
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TABLE  6.1 
Double,  Multiple,  and  Sequential  Plans  of  Example  I 


DOUBLE    SAMPLING 


Sample 
size 

Combined  samples 

Sample 

Size 

Acceptance 
number 

Rejection 
number 

First 

50 
100 

50 
150 

0 
2 

3 

Second 

3 

MULTIPLE    SAMPLING 


Sample 
size 

Combined  samples 

Sample 

Size 

Acceptance 
number 

Rejection 
number 

First 

Second .  % 

20 
20 
'  20 
20 
20 
20 
20 

20 

40 

60 

80 

100 

120 

140 

* 
* 

0 

1 
1 

1 
2 

2 
2 

Third 

2 

Fourth 

3 

Fifth 

Sixth 

3 
3 

Seventh 

3 

*  Acceptance  not  permitted  until  three  samples  have  been  inspected. 


SEQUENTIAL    SAMPLING 

<x  =  0.05         0  =  0.10         pi  =  0.005         p2  =  0.05 
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Fig.  6.1. — ASN  curves  for  double,  multiple,  and  sequential  plans  of  Table  6.1. 
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TABLE  6.2 
Double,  Multiple,  and  Sequential  Plans  of  Example  II 

DOUBLE    SAMPLING 


Sample 

Sample 
size 

Combined  samples 

Size 

Acceptance 
number 

Rejection 
number 

First 

Second 

150 
300 

150 
450 

9 
23 

24 
24 

] 

MULTIPLE    SAMPLING 

Sample 
size 

Combined  samples 

Sample 

Size 

Acceptance 
number 

Rejection 
number 

First 

Second       

50 
50 
50 
50 
50 
50 
50 
50 

50 
100 
150 
200 
250 
300 
350 
400 

1 
3 

7 
10 
13 
16 
19 
24 

6 
9 

Third 

13 

Fourth 

Fifth 

Sixth 

Seventh 

Eighth 

16 
19 
22 
25 

25 

SEQUENTIAL    SAMPLING 

a  =  0.05         13  =  0.10         pi  =  0.0410         p2  =  0.0920 

4.  COMPUTATIONAL  METHODS 

For  double  and  multiple  sampling,  N  and  a(N)  may  be  calculated 

by  an  extension  of  methods  described  in  Sampling  Inspection,  Chap.  17. 

For  sequential  sampling,  the  computations  for  N  =  E(N)  are  based  on 

the  formula  derived  in  Sequential  Analysis,  Sec.  5.0921,  and  those  for 

a(N)  on  the  relation 

<r2(iV)  =  E(N2)  -  [E(N)]2  (5) 

where  the  formula  for  E(N2)  may  be  obtained  by  an  extension  of  the 
method  used  in  Sec.  £.0921  of  Sequential  Analysis  for  obtaining  E(N). 
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CHAPTER  7 

PLANNING  AND  ANALYZING  EXPERIMENTS  FOR  COMPARING 
TWO  PERCENTAGES 


1.  TYPE  OF  PROBLEM 


The  type  of  problem  dealt  with  in  this  chapter  may  be  indicated 
by  a  simple  example.  Suppose  that  trials  of  two  fire-control  devices 
have  resulted  in  the  following  data: 


TABLE  7.1 

Hits 

Misses 

Total 

Old                 

3 
4 

197 
196 

200 

New    

200 

Total            

7 

393 

400 

The  question  is  whether  the  new  method,  which  shows  33  percent  more 
hits  than  the  old,  can  be  expected  to  yield  more  hits  in  the  long  run;  or 
whether  its  superiority  in  this  experiment  ought  to  be  regarded  as  mere 
chance,  in  the  sense  that  two  series  of  trials  with  the  same  method  would 
often  differ  as  much. 

We  may  look  at  the  question  this  way:  From  400  rounds,  7  hits 
were  secured.  If  the  two  methods  are  really  equally  good  in  the  long 
run,  these  7  hits  should  be  divided  between  the  two  series  in  proportion 
to  the  total  numbers  in  the  series,  except  for  chance  variations.  In 
the  present  case  we  should  expect  the  7  hits  to  be  divided  equally  between 
the  two  series;  and  similarly  the  393  misses.  As  it  happens,  the  division 
in  Table  7.1  is  as  nearly  equal  as  it  possibly  could  be,  since  the  entries 
are  necessarily  integers.  It  is  therefore  immediately  obvious  in  this 
particular  instance  that  the  probability  is  one-half  that,  if  the  two 
methods  were  really  equally  good  in  the  long  run,  a  single  experiment 
would  show  for  the  new  method  as  much  (or  more)  apparent  superiority 
as  is  shown  in  Table  7.1.  (If  this  is  not  obvious,  it  can  be  verified  by 
the  method  described  below  for  use  with  cases  where  the  answer  is  not 
apparent.) 

Data  such  as  those  of  Table  7.1  represent  a  double  dichotomy. 
The  observations  are  dichotomized  according  to  one  criterion — that  is, 
they  are  divided  into  two  mutually  exclusive  and  exhaustive  classes — and 
then  each  class  is  dichotomized  again  according  to  a  second  criterion. 
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The  result  of  doubly  dichotomizing  the  data  is,  of  course,  the  same 
whichever  criterion  is  applied  first;  that  is,  we  may  regard  the  data 
as  classified  first  by  method  (old  or  new)  and  then  by  effect  (hit  or  miss) , 
or  we  may  regard  them  as  classified  first  by  effect  and  then  by  method. 
The  marginal  totals  of  Table  7.1  represent  two  independent  dichotomies 
of  the  data;  they  do  not  constitute  a  double  dichotomy.  The  problem 
may  be  put  in  either  of  two  ways:  (1)  Is  the  division  between  effects  the 
same  for  both  methods?  (2)  Is  the  division  between  methods  the  same 
for  both  effects?  These  questions  are  logically  identical,  though  for  a 
particular  problem  one  of  them  usually  seems  more  natural  than  the  other. 
The  problem  is  thus  to  test  the  homogeneity  of  a  pair  of  dichotomies. 


2.  METHOD  OF  ANALYSIS* 


2.1.  Exact  Method 


In  order  to  illustrate  the  general  procedure  for  testing  the  homo- 
geneity of  a  double  dichotomy  such  as  that  of  Table  7.1,  consider  the 
data  of  Table  7.2,  where  the  answer  is  not  immediately  apparent. 

TABLE  7.2 


Successes 

Failures 

Total 

Standard              

2 
5 

198 
195 

200 

Experimental 

200 

Total       

7 

393 

400 

For  convenience  and  generality  the  two  methods  compared  are  here 
designated  "standard"  and  "experimental"  and  the  two  possible  outcomes 
of  a  trial  with  either  method  as  "success"  and  "failure."  In  practice, 
of  course,  the  two  methods  may  be  equally  experimental  or  equally 
standard,  and  the  two  possible  outcomes  may  be  two  types  of  success  or 
two  types  of  failure  or  merely  two  different  outcomes  not  properly 
regarded  as  either  success  or  failure. 

It  might  appear,  in  view  of  the  remarks  in  Sec.  1,  that  we  could 
treat  the  2:5  division  of  successes  like  a  2:5  division  of  heads  and  tails 
on  7  tosses  of  a  coin,  using  the  binomial  distribution  to  find  the  probability 
of  as  many  as  5  heads  on  7  tosses  if  the  coin  were  really  unbiased.  Table 
7.2,-  unlike  Table  7.1,  however,  gives  different  answers  for  the  successes 
and  failures  columns  according  to  this  treatment;  so  some  other  method  of 
analysis  is  called  for.  (In  this  particular  instance,  however,  it  happens 
that  analysis  of  the  seven  successes  according  to  a  binomial  distribution 
gives  an  excellent  approximation  to  the  correct  probability,  because  both 
proportions  of  successes  are  small;  see  below,  Sec.  2.2,  p.  255.) 

We  proceed  as  follows.     Let  P  designate  the  true  long-run  probability 

*See  Fisher,  Statistical  Methods,  Sees.  21,  21.01,  and  21.02.     (For  full  citations, 
see  Sec.  6.) 
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of  a  success,  that  is,  the  probability  of  a  success  in  the  population  from 
which  the  sample  comes.  P,  of  course,  is  unknown,  for  we  should  have 
to  carry  out  an  infinitely  long  series  of  trials  to  measure  it.  But  we 
make  our  calculations  on  the  assumption  that  P  is  the  same  in  both 
series,  and,  as  we  shall  see,  it  cancels  out.  If,  on  the  assumption  that  P 
is  really  the  same  in  the  long  run  for  both  series,  we  find  that  a  difference 
as  great  as  that  shown  in  the  data  would  be  unlikely,  then  the  fact  that 
this  unlikely  difference  has  actually  been  observed  experimentally  leads 
us  to  conclude  that  the  assumption  was  wrong — to  conclude,  in  other 
words,  that  the  long-run  probabilities  are  not  equal. 

The  probability  that  200  trials  will  show  exactly  2  successes  is,  by 
the  binomial  distribution, 

200  •      P2f1     _   p)198  (I) 

21198!      U  ;  K) 

Similarly,  the  probability  of  exactly  5  successes  is 

200! 


51195! 


P5(l  -  P)195  (2) 


So  the  probability  that  the  two  series  of  200  trials  will  show  exactly  2 

successes  for  the  first  series  and  exactly  5  for  the  second  is  the  product  of 

(1)  and  (2) 

200!200! 

2!5!l95!l98r    U       n  {6) 

This  is  an  absolute  probability,  whereas  we  require  the  conditional 
probability  that  if  exactly  7  successes  and  393  failures  occur  in  the  two 
series  they  will  be  divided  between  the  series  as  shown  in  Table  7.2; 
hence  (3)  must  be  divided  by  the  probability  of  exactly  7  successes  in 
400  trials. 

400' 
71393!      >  ;  [) 

The  probability  that  if  we  get  7  successes  among  two  series  of  200  each, 
they  will  be  distributed  2  to  the  standard  method  and  5  to  the  experi- 
mental, is  the  ratio  of  (3)  to  (4),  or 

7!393!200!200!        ^ 


400!2!5!195!198! 


It  should  be  noted  that  expression  (5)  would  be  the  same  if  expressions 
(1),  (2),  and  (4)  had  been  formed  from  the  two  columns  instead  of  from 
the  two  rows  of  Table  7.2. 

In  general,  expression  (5)  is  composed  as  follows:  The  numerator  is 
the  product  of  the  factorials  of  the  four  marginal  totals;  the  denominator 
is  the  product  of  the  factorials  of  the  four  cell  entries  and  the  factorial 
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of  their  sum.     Thus,  if  we  give  Tables  7.1  and  7.2  the  general  representa- 
tion shown  in  Table  7.3, 

TABLE  7.3 


Successes 

Failures 

Total 

Standard            

a 
c 

b 
d 

a  +  b 

Experimental 

c  +  d 

Total 

a  +  c 

b  +d 

a  +  b  +  c  +  d 

expression  (5)  may  be  written 

(a  +  b)\(c  +  d)\(a  +  c)\(b  +  d)\  , 

a\b\c\d\(a  +  b  +  c  +  d)\  W 

Expression  (5)  gives  the  probability  that  if  in  fact  the  long-run 
probabilities  were  equal,  an  experiment  involving  200  trials  of  each  method 
and  resulting  in  a  total  of  7  successes  would  show  exactly  the  superi- 
ority for  the  experimental  method  that  is  shown  by  Table  7.2.  For 
the  final  probability  to  use  in  judging  whether  there  is  a  long-run  differ- 
ence, we  must  add  to  (5)  the  probabilities  of  superiorities  greater  than 
that  of  Table  7.2.  Tables  7.4  and  7.5  show  possible  results  that  would 
have  indicated  even  greater  superiority  for  the  experimental  method  than 
does  Table  7.2. 

TABLE  7.4 


Successes 

Failures 

Total 

Standard 

1 

6 

199 
194 

200 

Experimental 

200 

Total 

7 

393 

400 

Expression  (6)  gives  the  probability  for  Table  7.4  as 
7!393!200!200! 


400!1!6!194!199! 

TABLE  7.5 


0.0534 


(7) 


Successes 

Failures 

Total 

Standard 

0 

7 

200 
193 

200 

Experimental 

200 

Total 

7 

393 

400 

Expression  (6)  gives  the  probability  for  Table  7.5  as 

7!393!200!200! 


400  !0!7!  1931200! 


=  0.0074 


(8) 
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The  sum  of  the  three  probabilities  (5),  (7),  and  (8),  or  0.2244,  is 
the  probability  that  the  experiment  will  show  as  much  or  more  apparent 
superiority  of  the  experimental  method  as  Table  7.2  shows,  if  in  fact 
there  is  no  real  difference.  It  should  be  noted  that  we  add  together  the 
probabilities  of  only  those  cases  representing  greater  superiority  for  the 
experimental  method  than  was  observed  in  the  sample.  We  do  not 
include  the  probabilities  of  cases  representing  greater  superiority  for  the 
standard  than  was  observed  for  the  experimental  method.  This  is 
because  we  are  analyzing  the  data  in  order  to  decide  whether  the  experi- 
mental is  superior  to  the  standard  method,  not  simply  to  decide  whether 
there  is  a  difference  between  the  methods  without  regard  to  its  direction. 

Finally,  a  conclusion  is  reached  regarding  the  superiority  of  the 
experimental  method  as  follows:  We  agree  to  accept  the  experimental 
method  as  superior  to  the  standard  on  the  basis  of  an  analysis  of  the 
above  type  only  when  the  resulting  probability  is  less  than  a,  where  a  is 
chosen  in  advance  and  is  generally  taken  to  be  0.10,  0.05,  0.01,  or  0.001, 
depending  upon  the  seriousness  of  accepting  the  experimental  method  as 
superior  when  it  is  not.  Then  a  is  the  probability  of  accepting  the 
experimental  method  as  superior  when  actually  there  is  no  real  difference 
between  the  methods.  In  terms  of  any  of  the  usual  values  of  a,  the 
probability  0.2244  would  imply  that  the  data  of  Table  7.2  provide  no 
basis  for  concluding  that  the  experimental  method  is  actually  superior 
to  the  standard.  + 

2.2.  Short-cut  Methods 

The  evaluation  of  the  factorials  involved  in  expressions  (5),  (7),  and 
(8)  was  done  by  cancellation  of  common  factors.  This  procedure  is  the 
simplest  in  this  case,  but  in  general  it  may  be  very  laborious.  The 
calculations  can  be  considerably  reduced  by  using  a  table  of  the  logarithms 
of  factorials.  * 

A  much  simpler  procedure,  which  usually  yields  very  satisfactory 
approximations  to  the  exact  method  described  in  Sec.  2.1,  is  available 
when  at  least  two  (and  preferably  five)  successes  would  be  expected  in 
each  cell  if  there  were  no  difference  in  the  long-run  probabilities  for 
the  two  methods.  This  expected  number  of  successes  is  computed  for  any 
cell  by  dividing  the  product  of  the  marginal  totals  for  that  cell  by  the 
total  number  of  trials  made;  thus,  the  expected  number  of  successes  for 
the  standard  method  is,  on  the  basis  of  Table  7.3, 

(a  +  b)(a  +  c)  .  . 

a  +  b  +  c  +  d  {  } 

*  A  convenient  table  is  given  in  Fry,  Probability  and  Its  Engineering  Uses. 
Shorter  tables  can  be  found  in  various  handbooks. 
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In  such  cases  we  calculate  a  quantity  %<•  as  follows: 
First,  adjust  the  table  of  observations  so  that  each  frequency  is 
brought  3^2  unit  nearer  the  frequency  expected  if  there  were  no  long-run 
difference  between  the  two  methods.  This  may  be  done  by  adjusting  one 
frequency  and  then  changing  the  others  so  that  the  marginal  totals  of 
the  new  table  equal  those  of  the  old.     For  example,  in  Table  7.2  the 


expected  frequency  for  the  first  cell  is 


7  X  200 
400 


=  3i 


so  Ave  increase 


the  observed  figure  of  2  to  2J/2-     Changing  the  other  figures  to  keep 
the  marginal  totals  fixed,  we  have 


TABLE  7.6 


Successes 

Failures 

Total 

Standard .  .                                              

2V2 

4:V2 

197M 
1953^ 

200 

Experimental 

200 

Total 

7 

393 

400 

Table  7.6  represents  Table  7.2  " corrected  for  continuity."* 

Second,  each  adjusted  frequency  is  subtracted  from  the  corresponding 
expected  frequency,  the  differences  are  squared,  the  squares  are  divided 
by  the  corresponding  expected  frequencies,  and  the  ratios  are  summed  to 
obtain   xl- 

The  final  value  of  xl  may  be  ^obtained  directly  without  going 
through  the  intermediate  steps,  as  follows:  Calculate  the  products  of 
frequencies  diagonally  opposite  each  other  in  the  adjusted  table;  subtract 
one  diagonal  product  from  the  other;  square  the  difference;  multiply  the 
square  by  the  sum  of  the  four  frequencies  in  the  table;  and  divide  this 
product  by  the  product  of  all  four  marginal  totals.  From  the  data  of 
Table  7.6,  we  have 


xl  = 


(2y2  x  1951^ 


4M  X  imiY  X  400  = 


7  X  393  X  200  X  200  (10) 

More  generally,  if  we  replace  a,  b,  c,  and  d  in  Table  7.3  by  a' ',  b' ',  c',  and  d' ', 
representing  the  frequencies  corrected  for  continuity,  (10)  becomes 

2        {a/d/_  -  b'c'YJa  +  b  +  c  +  d) 
Xc        (a  +  b)(a  +  c)(c-i-d)(b  +  d)  {     } 

*  Although  this  account  of  the  continuity  correction  describes  the  procedure 
accurately,  it  may  convey  an  erroneous  idea  of  the  theoretical  basis  for  the  correction. 
The  general  procedure  is  to  carry  through  the  calculation  of  %  from  the  original  data, 
then  find  the  next  lowest  value  of  %  that  could  have  arisen,  and  finally  take  xc  as 
halfway  between  these  two  values.  In  the  particular  case  of  double  dichotomies,  this 
value  of  x  can  be  found  most  easily  by  correcting  the  original  data  as  in  Table  7.6; 
but  in  most  other  cases  where  the  %  test  is  used,  correction  of  the  original  data  will 
lead  to  serious  error  in  the  value  of  x-  For  an  excellent  analysis  and  presentation  of 
this  point,  see  Cochran,  ''The  x2  Correction  for  Continuity,"  especially  p.  423. 
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Next,   we  take  the  square  root  and  find   Xc  —  0.7626.     This  may 
be  treated  as  a  normal  deviate;  that  is,  the  required  probability  is  given  by 


If" 

==/      e-Wdt  (12) 

2irJ     Xc 


V2 

Referring  to  a  table  of  the  normal  probability  integral,*  for  example 
Table  1.5  of  Chap.  1,  we  find  the  probability  to  be  0.2228.  This  com- 
pares favorably  with  the  exact  probability  of  0.2244  found  in  Sec.  2.1. 

More  accurate  values  of  Xc  corresponding  to  the  probabilities  0.025 
and  0.005  than  are  implied  by  (12)  are  given  in  Table  VIII,  "  Tests 
of  Significance  for  2X2  Contingency  Tables/'  in  Fisher  and  Yates, 
Statistical  Tables.  Detailed  instructions  and  illustrations  for  the  use  of 
this  table  are  given  in  its  introduction. 

When  the  percentage  of  successes  is  small  and  the  number  of  trials 
in  each  series  large,  as  is  the  case  in  the  examples  discussed  here,  the 
successes  may  be  analyzed  by  the  binomial  distribution,  along  the  lines 
discussed  in  Sec.  1  and  the  second  paragraph  of  Sec.  2.1.f  The  value  of 
P  for  the  binomial  test  is  the  proportion  of  all  trials  that  fall  into,  say, 
the  standard  series.  In  the  illustration  of  Table  7.2,  P  =  }^.  Accord- 
ing to  the  binomial  distribution,  the  probability  that  seven  trials  of  an 
event  having  two  equally  likely  outcomes  will  show  as  many  as  five  or 
more  of  a  designated  type  (in  other  words,  the  probability  that  with 
seven  successes  observed  five  or  more  will  occur  in  the  experimental 
series)  is 

l/7'  71  7'  \ 

V  \2W\  +  T!6!  +  OTtV  =  °-2266  (13) 

This  conforms  satisfactorily  to  the  exact  value  0.2244  found  in  Sec.  2.1. 

2.3.    Alternative  Methods 

A  procedure  that  has  been  widely  used  for  testing  the  significance 
of  the  difference  between  two  proportions  is  based  on  the  following 
considerations:  The  difference  between  the  two  percentages  is  divided 
by  the  standard  error  of  the  difference,  and  the  ratio  is  regarded  as 
normally  distributed.  The  standard  error  of  the  difference  is  computed 
from  the  familiar  relation  that  the  standard  error  of  the  difference 
between  two  independent  statistical  variables  is  the  square  root  of  the 
sum  of  the  squared  standard  deviations  of  the  two  variables.  The 
standard  error  of  a  proportion  p  is  well  known  to  be  \/P{l  —  P)/N, 

*  For  a  detailed  table,  see  the  tables  prepared  by  the  New  York  Mathematical 
Tables  Project. 

t  See  Przyborowski  and  Wilenski,  "  Homogeneity  of  Results  in  Testing  Samples 
from  Poisson  Series,"  especially  Sec.  2,  Derivation  of  Test,  pp.  314-317. 
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where  P  is  the  true  population  value  of  the  proportion  and  N  is  the 
number  of  trials. 

Applying  these  considerations,  it  is  often  concluded  that 

v  = .    (/F(f-p;i     .  d4) 

VPs(l   —  Ps)  +  Pe{1   —  Pe) 

where  pE  is  the  observed  proportion  for  the  experimental  method 
ps  is  the  observed  proportion  for  the  standard  method 
N  is  the  number  of  trials  with  each  method 
may  be  treated  as  normally  distributed  with  mean  of  0  and  standard 
deviation  of  1.     If  the  probability  of  obtaining  a  normal  deviate  as 
large  as  V  is  small,  it  is  concluded  that  the  difference  in  the  proportions 
is  greater  than  can  be  ascribed  to  chance. 

There  are  two  serious  objections  to  the  procedure  by  which  expres- 
sion (14)  is  derived:  (1)  V  will  not  actually  be  distributed  as  assumed; 
and  (2)  the  procedure  is  not  self-consistent. 

With  regard  to  the  distribution  of  V,  there  are  two  considerations: 
In  the  first  place,  the  true  population  standard  deviation  of  the  difference 
is  required  in  the  denominator  if  the  ratio  is  to  be  normally  distributed 
with  unit  variance  when  N  is  large;  but  actually  only  experimental 
values  are  available.  Hence,  chance  variations  affect  not  only  the 
numerator  but  also  the  denominator  of  V.  In  the  second  place,  even 
if  we  know  the  true  standard  deviation,  the  distribution  of  the  difference 
(Pe  —  ps)  cannot  be  satisfactorily  approximated  by  a  normal  distribution 
when  the  proportions  are  near  0  or  1,  unless  N  is  very  large. 

The  procedure  leading  to  expression  (14)  is  inconsistent  in  two 
respects.  It  sets  out  to  calculate  the  probability  that,  if  there  were 
actually  no  long-run  difference,  a  difference  as  great  as  that  observed 
would  occur  by  chance.  But  if  there  were  no  long-run  difference  in  the 
proportions,  there  would  be  none  in  the  standard  deviations.  Therefore, 
when  the  standard  deviation  is  estimated  from  the  experiment,  the  esti- 
mate should  be  based  on  the  two  series  together.  This  leads  to 
considering 

VN  (Pe  —  Ps) 


yjto  +  P-)(l    -  P~^) 


(15) 


as  a  normally  distributed  variable. 

The  second  inconsistency  in  expression  (14)  may  be  seen  as. follows: 
It  would  be  perfectly  logical  to  find  what  proportion  of  the  total  successes 
were  made  with  the  standard  method  and  what  proportion  of  the 
total  failures  were  made  with  the  experimental  method.  These  propor- 
tions should  be  equal  if  the  standard  and  experimental  methods  are 
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equally  effective.  But  if  the  significance  of  the  difference  of  this  pair  of 
proportions  is  tested  by  expression  (14),  the  result  is  not  the  same  as 
when  the  logically  equivalent  pair  based  on  methods  is  tested.  Expres- 
sion (15),  however,  is  consistent  in  this  respect  as  well  as  in  the  standard 
error  it  uses. 

Expression  (15)  is  mathematically  identical  with  formula  (11) 
in  the  previous  section,  except  that  formula  (11)  is  calculated  from  the 
data  corrected  for  continuity.  Thus,  in  terms  of  the  frequencies  of 
Table    7.3    (without   continuity    corrections),    expression    (15)    can   be 

written  as 

V"  +  b  +  c  +  d  (ad  —  be)  .     . 

*  ~  Via  +  b)(a  +  c)(b  +  d)(c~^dj 

with  a-}-b=c-\-d  =  N.  This  is  simply  the  square  root  of  expression 
(11),  neglecting  the  continuity  corrections.  When  the  continuity  cor- 
rections are  taken  into  account  and  N  is  introduced  for  simplification, 
expression  (16)  becomes 

ad  —  be  +  N 


V(N/$)(a  +  c)(b  +  d) 


(17) 


Note  that  N  in  expressions  (14),  (15),  (16),  and  (17)  represents  the 
number  of  trials  in  each  set.  Also  note  that  (17)  is  valid  only  when  the 
continuity  corrections  increase  a  and  d  and  decrease  b  and  c;  for  the 
reverse  situation,  -\-N  in  the  numerator  of  (17)  must  be  replaced  by  —  N. 

Expression  (14),  though  it  is  theoretically  inferior  to  expressions  (15) 
and  (17),  does  not  differ  materially  from  them  when  N  is  large.  Since 
(15)  and  (17)  are  theoretically  superior  and  are  little  or  no  more  trouble 
to  calculate,  they  are  always  preferable  to  (14).  Even  (15)  and  (17), 
however,  are  approximations;  they  fail  for  very  small  values  of  N  or 
very  small  probabilities  of  a  success,  and  in  such  cases  the  procedures 
outlined  above  in  Sees.  2.1  and  2.2  must  be  used.  The  procedure  of  Sees. 
2.1  and  2.2  should  be  used  also  when  a  +  b  is  different  from  c  +  d. 

For  practical  work  with  double  dichotomies,  the  Zubin  Item  Analyzer 
gives  results  essentially  equivalent  to  those  of  expression  (14).*  In 
this  device  the  difficulties  that  (1)  proportions  are  not  normally  dis- 
tributed when  the  true  proportion  is  small,  and  (2)  the  standard  devia- 
tion depends  upon  the  true  proportion,  are  avoided  by  employing  a  new 
variable  9,  which  is  a  function  of  p.  The  function  is  chosen  so  that,  first, 
0  is  more  nearly  normally  distributed  than  p,  and  second,  the  standard 
deviation  of  6  is  independent  of  the  true  long-run  value  of  6  or  p.     A 

*  For  a  description  of  this  device,  see  Fulcher  and  Zubin,  "The  Item  Analyzer." 
A  model  constructed  by  Dr.  Fulcher  was  lent  to  the  Statistical  Research  Group  for 
trial  in  1943.  It  proved  quick  and  easy  to  use  and  would  be  cheap  to  build,  as  it 
involves  only  plywood  and  celluloid. 
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function  that  meets  these  requirements  when  N  is  not  too  small  is 

0  =  2  arcsin  ^  (18) 

Unless  N  is  very  small,  6  may  be  regarded  as  normally  distributed;  the 
standard  deviation  is  l/\/N  when  6  is  measured  in  radians  or  57.3/-\/N 
when  0  is  measured  in  degrees.  The  Zubin  Analyzer  is  a  mechanical 
device  for  transforming  the  proportions  to  the  proper  angles  and  reading 
directly  the  difference  (6E  —  0s).  The  required  probability  can  then 
be  read  from  a  simple  chart  showing  the  difference  and  its  standard  error, 

3.  NECESSARY  NUMBER  OF  TRIALS 
A  method  that  is  sometimes  recommended  for  finding  the  required 
sample  size  in  experiments  involving  double  dichotomies  is  to  set  V  in 
expression  (14)  equal  to  the  value  Va,  which  would  be  exceeded  only  a 
percent  of  the  time  if  there  were  no  true  differences  in  the  proportions, 
and  then  solve  for  N,  using  values  of  ps  and  pE  obtained  experimentally. 
The  result  is 

N  = (19; 

(pE  -  ps)2 

This  procedure  can  be  made  logically  consistent  by  basing  the  calcula- 
tions on  expression  (15)  instead  of  on  (14),  in  which  case  the  formula 
becomes 

K2a(ps  +  Pm\  (  1  -  ~ ) 

N  =  ^ f-  (20) 

(Pe  -  Ps)2 

where  Ka  is  the  normal  deviate  exceeded  with  probability  a,  as  given  in 
Table  1.1  of  Chap.  l.| 

In  general,  (20)  solves  correctly  the  following  problem:  If  the  propor- 
tion of  successes  is  ps  for  the  standard  and  pE  for  the  experimental 
method,  how  large  must  be  the  number  of  trials  from  which  these  propor- 
tions were  computed  before  the  experiment  can  be  regarded  as  indicative 
of  a  real  difference?  The  question  stated  is,  however,  of  little  or  no 
practical  significance.  The  practical  problem  exists  only  because  it 
cannot  be  assumed  that  the  proportions  shown  in  the  experiment  are 

*  See  Chap.  16  for  an  account  of  the  inverse  sine  transformation  for  percentages. 

f  The  value  of  Kl  can  be  found  from  tables  of  the  chi-square  distribution,  for 
example,  Table  IV  in  Fisher  and  Yates," Statistical  Tables,  thus  eliminating  the  need  of 
squaring  Ka.  Since  the  probability  that  x2  will  exceed  a  certain  level  is  the  probability 
that  x  will  exceed  the  square  root  of  that  level  plus  the  probability  that  —  x  will  exceed 
the  square  root  of  that  level,  the  probabilities  given  in  the  x2  tables  must  be  halved 
for  the  present  purposes. 
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the  true  proportions.  If  they  were,  we  should  be  able  to  decide  without 
difficulty  which  method  is  superior.  Furthermore,  even  if  we  were 
willing  to  accept  the  proportions  ps  and  pE  shown  in  the  experiment  as 
suitable  approximations  of  the  true  long-run  proportions  Ps  and  PE,  we 
could  compute  N  from  (20)  only  after  carrying  out  the  experiment  to 
find  ps  and  pE. 

To  determine  a  useful  formula  for  N,  we  must  think  of  formula 
(11)  or  its  equivalent,  formula  (15),  as  constituting  a  rule  according 
to  which  we  decide  from  the  data  which  of  two  actions  to  take.  If  the 
formula  yields  a  value  above  some  predetermined  critical  level,  we  accept 
the  experimental  method  as  superior,  and  if  it  yields  a  lower  value  we  do 
not  accept  it  (that  is,  for  practical  purposes  we  reject  it).  If  the  true 
proportion  of  successes  for  the  experimental  method  is  the  same  as  the 
true  proportion  of  successes  for  the  standard  method,  it  is  important 
that  the  critical  level  of  %  be  such  that  there  is  only  a  small  probability, 
say  a,  that  the  data  will  lead  to  a  value  of  %  above  the  critical  level. 
Similarly,  if  the  true  proportion  of  successes  is  Ps  for  the  standard 
method  but  PE  (PE  >  Ps)  for  the  experimental  method,  it  is  important 
that  the  critical  level  of  x  and  the  sample  size  N  be  such  that  there  is 
only  a  small  probability,  say  /3,  that  the  data  will  lead  to  a  value  of  x 
below  the  critical  level. 

The  first  condition,  that  the  probability  be  only  a  of  accepting  the 
experimental  method  as  superior  when  the  true  probabilities  of  success 
are  equal,  is  met  simply  by  selecting  the  critical  level  of  x  as  Ka,  the 
normal  deviate  exceeded  with  probability  a.  Then,  regardless  of  the 
sample  size,  the  probability  is  a  that  the  experimental  method  will  be 
accepted  when  the  true  probabilities  of  success  are  the  same  for  both 
methods. 

On  the  other  hand,  for  small  samples  the  probability  that  x  will 
exceed  Ka  will  not  be  much  more  than  a  even  when  the  true  probabilities 
are  Ps  and  PE.  This  probability  of  accepting  the  experimental  method 
when  the  true  probabilities  are  Ps  and  PE  will,  however,  increase  as  the 
sample  size  increases ;  and  the  problem  is  to  find  that  value  of  N  for  which 
it  is  1  -  p. 

The  required  value  of  N,  the  size  of  the  sample  to  be  used  with 
each  method,  is  given  by 

N  =  I  ( ; 5-°  +  g* yJ]2  (21) 

z  \arcsm  \/Ps  —  arcsm  -\/PE) 
if  angles  are  measured  in  radians,  or  by 

N  =  1641.6  ( : J^  +  K\ =,Y  (22) 

\arcsin  -yPs  —  arc  sin  \/PeJ 
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if  angles  are  measured  in  degrees,  where  Ke  is  denned  by 

-^\\-^dx  =  ,  (23) 

a  is  the  probability  of  an  error  of  the  "first  kind,"  that  is,  of  accepting 
the  experimental  method  as  superior  when  actually  the  probability 
of  success  is  the  same  for  both  methods 

(3  is  the  probability  of  an  error  of  the  " second  kind,"  that  is,  of 
rejecting  the  experimental  method  when  the  true  probabilities 
for  the  standard  and  experimental  methods  are  Ps  and  PE, 
respectively.* 

Values  of  N  can  be  read  directly  from  the  nomogram  of  Fig.  7.1. 

The  choice  of  Ps  can  usually  be  based  upon  previous  experience  with 
the  standard  method,  which  will  indicate  approximately  its  proportion 
of  successes.  The  value  of  PE  may  then  be  chosen  enough  above  Ps  so 
that  the  difference  is  one  that  it  is  important  to  detect  when  present. 
Moderate  variations  in  the  values  of  Ps  and  PE  are  not  serious  if  Ps  and  PE 
are  affected  alike.  If  the  angular  difference,  the  square  of  which  con- 
stitutes the  denominator  of  formula  (22),  is  accepted  as  the  appropriate 
measure  of  the  difference  between  two  percentages,  it  is  not  necessary 
to  choose  Ps  and  PE  individually  but  only  to  determine  the  angular 
difference  that  should  be  detected,  f 

To  illustrate  the  application  of  formula  (21),  suppose  Ps  =  0.010, 
PE  =  0.025,  a  =  0.05,  and  0  =  0.10.  From  Table  1.1  of  Chap.  1,  we 
find  Ka  —  1.645  and  Kp  =  1.282.  From  trigonometric  tables,  J  we  find 
arcsin  0.1000  and  arcsin  0.1581  to  be,  respectively,  0.1002  and  0.1588 
radians.     Substituting  these  figures  into  (18),  we  have 

=      (1.645  +  1.282)' 

2(0.1588  -  0.1002)2  y     J 

=  1,246 

*  If  only  one  method  is  involved  in  an  experiment,  and  if  it  is  desired  that  the 
probability  of  approving  it  be  a  or  1  —  /3  according  as  its  true  probability  of  success  is 
Ps  or  PE,  formula  (21)  will  indicate  twice  the  appropriate  sample  size;  that  is,  the 
appropriate  sample  size  is  given  by  substituting  4  for  2  in  the  denominator  of  formula 
(21).  This  was  the  method  of  computing  the  sample  sizes  Na  given  in  Table  1.3  of 
Chap.  1  for  acceptance-inspection  plans  based  on  attributes.  For  that  application, 
the  first  sentence  of  this  footnote  should  be  changed  to  read  as  follows:  If  only  one 
product  is  involved  in  an  inspection  plan,  and  if  it  is  desired  that  the  probability  of 
rejecting  be  a  or  1  —  /3  according  as  the  true  probability  of  a  defective  is  px  or  p2, 
formula  (21)  will  indicate  twice  the  appropriate  sample  size. 

f  In  this  connection,  see  Statistical  Research  Group,  Sequential  Analysis,  Sec. 
3.62,  pp.  3.47-3.50,  especially  Table  3.61  and  Fig.  3.61;  also  Sees.  3.21  and  3.22,  pp. 
3.09-3.10. 

%  For  example,  Mathematical  Tables  Project,  Arc  sin  x. 
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If  the  nomogram  is  used,  we  pass  a  line  between  0.05  on  the  a  scale  and 
0.10  on  the  /3  scale  and  mark  its  point  of  intersection  with  the  A  scale. 
Another  line  is  passed  between  0.01  on  the  Ps  scale  and  0.025  on  the  PE 
scale,  and  the  point  of  intersection  of  this  line  with  the  B  scale  is  marked. 
The  N  scale  is  calibrated  so  that  the  number  of  trials  needed  for  each 
method  can  be  read  off  where  the  N  scale  is  intersected  by  a  straight 
line  between  the  points  marked  on  the  A  and  B  scales.  In  this  example, 
the  point  of  intersection  falls  about  halfway  between  N  =  1,200  and 
N  =  1,300. 

4.  OPERATING  CHARACTERISTICS 

Conclusions  drawn  from  double  dichotomies  depend  upon  two  quanti- 
ties, one  characterizing  the  amount  of  data  collected  and  the  other 
characterizing  the  method  of  analyzing  the  data.  These  quantities 
are  N,  the  number  of  observations  with  each  method,  and  Ka,  the  critical 
level  of  x  which,  if  equaled  or  exceeded,  leads  to  acceptance  of  the 
experimental  method  as  superior.  Formula  (21)  can  be  used  to  deter- 
mine the  operating  characteristics  of  any  plan  for  analyzing  a  double 
dichotomy  denned  by  given  values  of  N  and  Ka. 

By  the  operating  characteristics  of  a  plan  is  meant  the  probability, 
as  a  function  of  the  true  nature  of  the  populations,  that  the  plan  will 
lead  to  accepting  the  experimental  method  as  superior.  In  this  case,  the 
operating  characteristics  may  be  represented  by  a  surface  rather  than  a 
curve,  for  the  true  nature  of  the  populations  is  measured  by  two  param- 
eters Ps  and  PE.  If  in  the  plane  of  the  page  wTe  measure  PE  lengthwise 
and  Ps  crosswise,  any  point  for  which  PE  and  Ps  are  both  between  0  and  1 
represents  a  possible  combination  of  PE  and  Ps.  Vertically  above  each 
point  we  may  erect  a  line  whose  height  represents  the  probability  that  a 
test  using  the  given  N  and  Ka  will  lead  to  accepting  the  experimental 
method  as  superior.  The  surface  formed  by  the  tops  of  these  lines  is  the 
OC  surface.  A  test  is  satisfactory  if  its  OC  surface  shows  high  probabili- 
ties of  accepting  the  experimental  method  when  its  superiority  is  suffi- 
cient to  make  such  acceptance  important,  and  low  probabilities  of  accept- 
ing the  experimental  when  its  superiority  is  so  small  (especially  when  it 
is  zero  or  negative)  as  to  make  its  acceptance  undesirable. 

We  may  reAvrite  (21)  as 

-Kp  =  Ka  -  \/W  (arcsin  ^/Pe  -  arcsin  VF7)  (25) 

Formula  (25)  yields  —Kp;  the  probability  of  accepting  the  experimental 
method  is  1  —  (3,  which  is  the  probability  that  a  normal  deviate  will 
exceed  —  Kp  and  may  be  obtained  from  Table  1.5  of  Chap.  1.  Thus,  for 
any  combination  of  PE  and  Ps,  formula  (25)  yields  the  probability  that  a 
test  based  on  N  and  Ka  will  lead  to  acceptance  of  the  experimental 
method. 
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Formula   (25)    does  not  depend  upon  the  individual  probabilities 
Pe  and  Ps  but  only  upon  their  angular  difference 


so  we  may  write  it 


T  =  arcsin  \/Pe  —  arcsin 


Kp  =  V2N  T  -  Kc 


(26) 


(27) 


Letting  LT  denote  the  probability  that  a  test  defined  by  N  and  Ka  will 
lead  to  acceptance  of  the  experimental  method  when  the  true  angular 
difference  is  T,  and  noting  that  Ki-p  =  —Kp,  we  have 


V2tt  JKc 


e    2  dx 


V2N  T 


(28) 


as  the  equation  of  the  OC  curve  in  terms  of  T.     Values  of  LT  can  be 
obtained  from  Table  1.5,  Chap.  1,  by  taking 


Ke  =  Ka-  V2N  T 


(29) 


Fig.   7.2. — Operating-characteristic  curve  for  a  comparison  of  two  percentages,  JV   =  500, 

Ka   =  1.645. 
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Figure  7.1  can  be  used  to  determine  LT  from  (28).  The  given  values 
of  PE  and  Ps  determine  a  point  on  the  B  scale.  A  line  is  drawn  through 
the  point  on  the  B  scale  and  the  given  value  of  N  on  the  N  scale  and 
extended  until  it  intersects  the  A  scale.  This  point  on  the  A  scale  and 
the  given  value  of  a  on  the  a  scale  determine  a  line  whose  intersection 
with  the  |8  scale  gives  1  —  Lt. 

In  evaluating  a  proposed  test,  it  is  sometimes  helpful  to  plot  the  OC 
curve  against  the  true  proportion  of  successes  for  the  experimental 
method,  assuming  some  value,  say  P's,  for  the  proportion  of  successes  with 
the  standard  method.  This  gives  a  picture,  the  significance  of  which 
depends  on  the  validity  of  the  value  used  for  P'8J  of  the  absolute  level 
of  performance  by  the  experimental  method  that  will  result  in  a  high 
probability  of  its  acceptance  or  rejection.  Such  a  picture  can  be  obtained 
from  formula  (25),  replacing  Ps  by  P's  and  calculating  LT  =  1  —  ft 
for  various  values  of  PE. 

Figure  7.2  shows  the  OC  curve  for  an  analysis  based  on  a  sample  size 
N  of  500  for  each  method  and  a  critical  level  of  x,  Ka,  of  1.645.  Two 
horizontal  scales  are  shown,  the  lower  in  terms  of  T,  the  true  angular 
difference  between  the  two  probabilities,  and  the  upper  in  terms  of  PE 
on  the  assumption  that  Ps  =  0.10. 

5.  TECHNICAL  NOTES 

Let  sE  and  ss  be  the  observed  numbers  of  successes  in  N  trials  with 
the  experimental  and  standard  methods,  respectively.  Then  the 
observed  proportions  of  successes  with  the  two  methods  are 

Pe  =  ^         and         ps  =  -^  (30) 

When  we  are  interested  only  in  alternative  hypotheses  for  which 
PE,  the  true  proportion  of  successes  with  the  experimental  method,  is 
greater  than  Ps,  the  long-run  proportion  of  successes  with  the  standard 
method,  the  classical  test  with  level  of  significance  a  is  to  reject  the  null 
hypothesis  that  PE  =  Ps  if 

\/N(pE  —  ps) 


=  >Ka  (31) 

where 


— =  /      e    2  dx  =  a  (32) 

Except  when  N  is  very  small,  or  PE  or  Ps  very  close  to  0  or  1,  this 
test  is  essentially  equivalent  to  the  criterion :  Reject  the  null  hypothesis  if* 

y/2N  (arcsin  y/pE  —  arcsin  y/ps)  >  Ka  (33) 

*  See  Chap.  16  for  a  discussion  of  the  inverse  sine  transformation  of  percentages, 
upon  which  what  follows  is  based. 
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because  the  quantity 

t  =  arcsin  \/pE  —  arcsin  y/ps  (34) 

may  be  regarded  as  normally  distributed  with  mean  of 

T  =  arcsin  s/Pe  —  arcsin  \/P~s  (35) 

and  variance  l/2iV,  whence  it  follows  that  when  the  null  hypothesis  is 
true,  y/2N  t  is  normally  distributed  about  zero  with  unit  standard 
deviation. 

Noav,  to  find  N  so  that  the  probability  is  (3  of  accepting  the  null 
hypothesis  when  T  =  Tf,  it  is  necessary  that 

Prob  [\/2N  (arcsin  Vp^  -  arcsin  \Zp~s)  <  Ka\T']  =  p         (36) 
This  is  equivalent  to  requiring  that 

1  rKa-V2NT'      _£f 

vrj_„       *■'*-'  (37) 

which  becomes 

Ka  -  V2N  Tf  =  -Kp  (38) 

where  K0  is  defined  by  replacing  a  by  /3  in  equation  (32).  Solving  for  N 
gives 

or 

1  /  Ka  +  Kp  \2 

N  =  o  ( : — 7w • — 7w )  (4°) 

2  \arcsm  y/Pis  —  arcsin  y/P&) 

It  may  be  observed  that  formula   (40)   follows  immediately  from 
formula  (148)  in  Chap.  1,  Sec.  11.1.3.     We  have 


Zi  =  0 

(41) 

Z2  =  arcsin  \/PE  - 

-  arcsin  y/Ps 

(42) 

a  =  1 

(43) 

v)-  L 

(44) 

Substituting  into  equation  (148)  of  Chap.  1,  we  have 
1  arcsin  \/P~e  —  arcsin  \/Pi 


V2F  K„  +  Kf, 


(45) 


which,  when  solved  for  N,  leads  to  (40).  Similarly,  equation  (147)  of 
Chap.  1  leads  to  a  critical  level  A  of  Ka/\/2N  for  arcsin  \/pE  —  arcsin 
\/ps,  which  is  equivalent  to  (33). 
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NUMBER  OF  CASES  REQUIRED  FOR  COMPARING  TWO  PERCENTAGES 

Directions 

1.  Select  values  of  a,  p\  Ps,  and  PE  (See  "Explanation"). 

2.  Locate  the  value  of  at  on  the  scale  marked  "a." 

3.  Locate  the  value  of  /3  on  the  scale  marked  "/3." 

4.  Locate  the  point  at  which  a  straight  line  from  a.  to  /3  cuts  the  scale  marked  "A." 

5.  Locate  the  value  of  Ps  on  the  scale  marked  "Ps."  If  Ps  exceeds  0.50  use  1  -  Ps 
and  make  a  corresponding  change  of  Pe  to  1   —  Pe- 

6.  Locate  the  value  of  Pe  on  the  scale  marked  "Pe." 

7.  Locate  the  point  at  which  a  straight  line  from  Ps  to  Pe  cuts  the  scale  marked  "B." 

8.  Locate  the  point  at  which  the  scale  marked  "N"  is  cut  by  a  straight  line  from  the 
point  on  "B"  to  the  point  on  "A."  The  "N"  scale  is  calibrated  so  that  the  required 
sample  size  N  is  given  opposite  this  point. 


H 1 — i     '    i    i  i  I 


|nn|M.,H""l 


Call  the  two  percentages  being  compared  the  "standard"  and  1 


'experimental.' 


a  is  defined  as  foUows:  We  will  conclude  that  the  experimental  method  is  better  only  if  it  shows  in  the  experiment  a  superiority  so  great 
that  the  probability  of  this  much  superiority  would  be  only  a  if  the  true  long-run  percentages  were  equal.  In  other  words,  a  is  the 
probability  that  we  will  erroneously  judge  the  experimental  method  better  when  in  fact  the  two  methods  are  equally  good. 
/3  is  the  probability  that  we  will  fail  to  judge  the  experimental  method  better  when  in  fact  it  is  better.  In  other  words,  it  is  the  prob- 
ability that  the  experimental  method,  although  actually  better  in  the  long  run,  will  show  a  smaller  superiority  in  the  experiment  than 
is  required  by  the  value  of  a  for  concluding  that  it  is  better. 
Ps  is  an  estimate  of  the  true  long-run  probability  for  the  standard  method.     This  estimate  can  be  based  on  prior  experience,   scientific 

judgment,  etc. 
Pe  is  an  estimate  of  the  true  long-run  probability  for  the  experimental  method.     This  estimate  may  be  determined  like  that  for  Ps;  or  it 
may  be  the  smallest  figure  above  Pg  for  which  it  would  be  important  to  detect  the  superiority.     For  example,  if  Ps  =  0.10,  and  if  it 
would  be  important  to  detect  any  superiority  of  more  than  50  percent,  we  set  Pe  =  0.15. 

■Ka 
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.030-; 


rV  is  the  number  of  trials  with  each  method  required  in  the  experiment,  as  given  by  the  formula 


2  Va 


/,: 


VpI 
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i  measured  in  radians. 
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CHAPTER  8 

PLANNING  AND  INTERPRETING  EXPERIMENTS 
FOR  COMPARING  TWO  STANDARD  DEVIATIONS 

1.  INTRODUCTION 

The  present  chapter  is  concerned  with  the  planning  and  interpretation 
of  experiments  to  determine  whether  (1)  a  given  product,  or  process, 
is  more  (or  less)  variable  than  a  standard  product,  or  process;  (2)  one 
of  twTo  new  products,  or  processes,  is  more  (or  less)  variable  than  the 
other;  (3)  the  random  variation  of  a  product,  or  process,  has  a  component 
assignable  to  a  specific  source  of  variation. 

With  regard  to  proposed  experiments,  the  problem  considered  is 
that  of  determining  how  many  observations  will  be  required  in  order  to 
render  the  discriminating  power  of  the  experiment  adequate  for  the 
purposes  at  hand.  The  solutions  to  this  problem,  in  addition  to  being 
useful  in  the  planning  of  experiments,  provide  a  means  of  evaluating  the 
discriminating  power  of  an  experiment  already  carried  out.  Thus,  in 
planning  experiments  for  the  three  purposes  listed  above,  the  results 
given  below  will  enable  a  research  worker  to  formulate  an  experiment  the 
size  of  which,  on  the  one  hand,  is  compatible  with  the  cost  (monetary 
expense,  time,  inconvenience,  and  so  forth)  of  obtaining  the  observa- 
tions and,  on  the  other,  limits  to  a  known  extent  the  risks  of  making 
erroneous  decisions  of  practical  significance.  Alternatively,  in  the  case  of 
experiments  already  conducted,  the  results  given  below  will  enable  the 
experimenter  to  judge  the  confidence  that  he  can  place  in  conclusions 
based  on  the  data  obtained. 

2.  EXPERIMENTS  TO  DETERMINE  WHETHER  A  NEW  PRODUCT 
IS  MORE  VARIABLE  THAN  A  STANDARD  PRODUCT 

2.1.  Problem 

The  following  situation  is  common  in  practical  work :  A  certain  product 
has  been  in  use  for  some  time  and  its  variability  with  respect  to  some 
important  measurable  characteristic  x  is  known.  Let  u\  denote  the 
known  variance  (squared  standard  deviation)  of  x  for  this  standard 
product.  A  new  product  is  under  consideration,  whose  variability  with 
respect  to  x  is  unknown;  let  erf  denote  the  unknown  variance  of  x  for  the 
new  product.     Since  the  value  of  a{  is  unknown,  it  is  not  possible  to 
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decide  with  certainty  whether  the  variability  of  the  new  product  is  more 
than  that  of  the  standard,  that  is,  whether  u\/a\  >  1. 

2.2.  General  Procedure  for  Basing  an  Answer  on  Observational  Data 

An  answer  to  this  question  can,  however,  be  obtained  by  computing 
an  estimate  s\  of  <j\  from  values  of  x  observed  in  a  sample  of  the  new 
product  and  agreeing  that  sf/o-Q  >  C  will  be  taken  to  imply  that  (t\/u\  >  1, 
and  s\/ul  <  C  will  be  taken  to  imply  that  <j\/<j\  <  1,  where  C  is  some 
positive  constant  chosen  in  advance,  that  is,  before  the  value  of  sf  has 
been  obtained,  preferably  before  the  experiment  determining  the  value 
of  sf  is  carried  out.  It  is  evident  that  if  C  is  very  large,  application 
of  this  procedure  will  almost  always  lead  to  the  decision  that  the  new 
product  is  not  more  variable  than  the  standard  (a\  <  o-2,);  if  C  is  very 
small,  the  decision  will  almost  always  be  that  the  new  is  more  variable 
than  the  standard  (<r|  >  erg).  Clearly,  some  intermediate  value  of  C  is 
called  for — a  value  that  will  not  lead  too  frequently  to  the  decision 
<rf/o-o  >  1  when  the  reverse  is  true  and  yet  will  lead  to  this  decision  in 
most  of  the  cases  in  which  the  excess  of  g\  over  o~l  is  of  practical 
importance. 

2.3.  Standard  Procedure* 

The  standard  procedure  is  to  use  as  an  estimate  of  <j\  a  function  s\ 
of  the  observed  values  of  x  that  has  the  property  that  in  repeated  sampling 
n\&\/<r\  is  distributed,  at  least  to  a  good  approximation,  as  x2  for  n 
degrees  of  freedom,  and  to 

a.  Conclude  a\/u\  <  1  whenever 

4  <  ^  (i) 

b.  Conclude  <t\I<j\  >  1  whenever 

4>^  (2) 

where  x«(^i)  is  the  a  level  of  significance  of  x2  for  Wi  degrees  of  freedom, 
that  is 


for  n  =  Wi.f 


*  See  Fisher,  Statistical  Methods,  Chap.  IV,  Sec.  21.03.   (For  full  citations,  see 
Sec.  7). 

f  See  Sec.  6,  below,  for  a  discussion  of  published  tables  of  this  integral. 
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Example  I.  The  variance  o-jj  of  a  particular  dimension  of  a 
standard  product  is  3.2  cm.2  From  measurements  of  the  dimension 
on  10  items  of  a  new  product  it  is  found  that  s\  =  4.8,  with  9  degrees 
of  freedom.  Does  the  variance  g\  of  the  dimension  with  the  new 
product  exceed  the  variance  o-2,  of  the  dimension  with  the  standard 
product? 

If  we  conduct  the  test  at  the  0.05  level  of  significance,  we  find 
sl/4  =  1.5  <  1.880  =  Xo.05(9)/9  and  are  led  to  conclude  aj/al  <  1; 
that  is,  we  conclude  that  the  variability  of  the  new  product  does  not 
exceed  the  standard  in  respect  of  the  dimension  in  question. 

2.4.  Operating  Characteristics  of  the  Standard  Procedure 

The  question  arises,  what  are  the  chances  of  the  standard  rule  leading 
to  a  wrong  conclusion?  That  is,  when  in  fact  u\/al  <  1,  how  often  will 
it  lead  to  the  conclusion  (j\/crl  >  1,  termed  an  error  of  the  first  kind;* 
and  when  in  fact  o-\/<rl  >  1,  how  often  will  it  lead  to  the  conclusion 
o"i/o"o  <  1,  termed  an  error  of  the  second  kind?  In  other  words,  what 
are  the  operating  characteristics  of  the  procedure? 

Clearly,  the  chance  of  an  erroneous  decision  will  depend  on  cr2,, 
Wi,  and  a,  as  these  quantities  are  involved  explicitly  in  inequalities  (1) 
and  (2).  Intuition  tells  us  that  the  chance  of  error  depends  also  on 
the  actual  value  of  al/al,  that  is,  upon 

>-■$  <4> 

To  begin  with,  it  is  clear  from  (3)  that  the  probability  of  an  error  of 
the  first  kind  is  exactly  a  when  p  =  1.  Second,  it  will  be  shown  in 
Sec.  6  that  the  probability  of  an  error  of  the  second  kind  will  be 
exactly  (3  if  p  has  the  value 

p(a,/3,ni)  =     2         -  (5) 

This  equation  has  been  used  to  compute  tables  of  p(a,(3,ni)  corresponding 
to  a  =  0.05  and  0.01  and  to  a  wide  selection  of  values  of  j3  and  n\.  These 
are  published  as  Tables  8.1  and  8.2,  pp.  272-273  and  276-277;  the  method 
of  their  construction  and  the  accuracy  of  interpolation  is  discussed  in 
Sec.  6. 

The  following  examples  illustrate  the  use  of  Tables  8.1  and  8.2  in 
the  interpretation  of  the  results  of  experiments  already  carried  out 
and  in  the  planning  of  experiments: 

Example  II.     For  what  value  of  the  ratio  aj/al  is  the  probability 
of  an  error  of  the  second  kind  equal  to  0.50  when  the  test  of  Example 
I  is  employed? 
*  This  terminology  was  introduced  by  Neyman  and  Pearson. 
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For  a  =  0.05,  ni  =  9,  and  0  =  0.50,  we  find  from  Table  8.1 
that  p(0.05,0.50,9)  =  2.0280.  This  means  that  if  <rf  is  roughly 
double  a\  (that  is,  if  the  standard  deviation  of  the  new  product  is 
roughly  1.4  times  that  of  the  standard)  and  the  test  is  conducted  at 
the  0.05  level  of  significance  with  s\  based  on  only  9  degrees  of 

.OJT  Level  of  Significance 
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Fig.  8.1. — Operating-characteristic  curves  for  experiments  to  determine  whether  a 
new  product  (process)  is  more  variable  than  a  standard  product  (process)  of  known  vari- 
ability, 0.05  level  of  significance. 

freedom,  then  there  is  approximately  an  even  chance  that  the  test 
will  lead  to  the  wrong  conclusion,  o\/ol  <  1.  If,  to  ensure  a  smaller 
risk  of  committing  an  error  of  the  first  kind,  the  test  is  conducted 
at  the  0.01  level  of  significance,  the  situation  is  worse,  for 

p(0.01,0.50,9)  =  2.5970 
(from  Table  8.2).     Clearly  9  degrees  of  freedom  for  s{  are  too  few 
if  a  decision  that  <t\/<jI  <  1  is  a  serious  error  from  the  practical 
viewpoint  when  a\  is  double  cr§. 

Example  III.     If  a  decision  that  a\/a20  <  1   is  a  serious  error 
from  the  practical  viewpoint  when  cri  =  1.5o-0  (p  =  2.25),  and  it  is 
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desired  to  keep  the  risk  of  such  an  error  below  0.05  ((3  =  0.05) 
when  the  test  is  conducted  at  the  0.05  level  of  significance  (a  =  0.05), 
how  many  degrees  of  freedom  will  be  needed  for  sf? 

On  looking  in  the  /3  =  0.05  column  of  Table  8.1,  we  find  that 
m  =  30  is  not  quite  sufficient,  since  p(0.05,0.05,30)  =  2.3670;  but 

.01   Level  of  Significance 
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Fig.  8.2. — Operating-characteristic  curves  for  experiments  to  determine  whether  a  new 
product  (process)  is  more  variable  than  a  standard  product  (process)  of  known  variability, 
0.01  level  of  significance. 


ni  =  40  is  ample,  since  p(0.05,0.05,40)  =  2.1034.  By  interpolation, 
using  1/v^i  as  argument,  it  is  found  that  34  (33.7)  degrees  of 
freedom  are  needed  (linear  interpolation  gives  34.4,  which  would 
be  taken  as  35). 


If 


0-Q 


1.79   cm.    (al  =  3.2   cm.2)    as   in   Example   I,    so   that 


(7i  =  1.5(70  =  2.69  cm.,  and  it  is  felt  that  this  is  too  large  an  excess  to 
have  undetected  1  time  in  20,  but  failure  to  detect  that  o\  >  gq 
when  <7i  =  2.00  cm.,  say,  could  be  tolerated  1  time  in  20,  then 
p  =  1.25,  and  it  is  clear  from  an  examination  of  Table  8.1  that  a 
very  large  number  of  degrees  of  freedom,  over  400,  are  needed — 
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an  astonishingly  large  number  to  persons  who  have  never  investigated 
this  matter  before.* 

As  is  pointed  out  in  the  footnote  to  the  tables,  p(a,P,ni)  is  the  value 
of  c\/g%  for  which  the  probability  of  the  decision  a\/a\  <  1  equals  0  when 
the  test  is  conducted  at  the  a  level  of  significance  with  n\  degrees  of 
freedom.  When  p(a,P,ni)  <  1,  /?  can  no  longer  be  interpreted  as  the 
probability  of  an  error  of  the  second  kind,  since  the  decision  <j\/<j\  <  1 
is  correct  under  these  circumstances. \  The  values  of  p(a,(3,ni)  <  1  have 
been  included  in  the  tables  since  they  are  useful  for  drawing  the  operating- 
characteristic  curves. 

Let  Lp  =  L(a,rii,p)  denote  the  probability  that  the  decision  g\/g\  <  1 
will  be  made  when  g\/(j\  =  p  and  the  test  defined  by  formulas  (1)  and  (2) 
is  conducted  at  the  a  level  of  significance  with  ni  degrees  of  freedom 
for  s\.  Then  Lp  plotted  as  a  function  of  p,  for  fixed  values  of  a  and  nh 
is  termed  the  operating-characteristic  curve  of  the  test  so  defined.  Since 
the  probability  that  the  decision  a\la\  >  1  will  be  reached  is  1  —  Lp,  in 
view  of  the  exhaustive  character  of  inequalities  (1)  and  (2),  it  is  clear 
that  the  curve  Lp  completely  specifies  the  operating  characteristics  of  the 
test,  that  is,  the  expected  performance  of  the  test  in  repeated  applications. 
Thus,  it  is  not  necessary  to  plot  the  complementary  curve  1  —  Lp,  except 
perhaps  for  psychological  reasons.  A  probability  of  0.90  that  a  correct 
decision  will  be  made  may  evoke  less  concern  about  the  stringency  of  the 
test  than  a  statement  that  the  probability  of  an  incorrect  decision  is  0.10. 

Figure  8.1  shows  the  operating-characteristic  curves  correspond- 
ing to  5,  10,  15,  20,  30,  40,  60,  and  120  degrees  of  freedom  when 
a  =  0.05;  Fig.  8.2  shows  the  corresponding  curves  when  a  =  0.01. 

3.  EXPERIMENTS  TO  DETERMINE  WHETHER  A  NEW  PRODUCT 
IS  LESS  VARIABLE  THAN  A  STANDARD  PRODUCT 

3.1.  Problem 

There  are  not  many  situations  in  which  a  new  product  is  preferred  to 
a  standard  product  only  if  its  variability  exceeds  that  of  the  standard. 
Consequently,  a  test  of  the  hypothesis  a{/al  >  1  relative  to  the  alterna- 

*  The  following  formula  may  be  used  when  n  is  large,  say  more  than  30: 
n  -  1  (&«**  +^A2 

2    \  <Ti     —    (To  / 

m  i  (Ka  +K0y"py 

2  V      Vp  -  1      / 

where  K(  denotes  a  normal  deviate  exceeded  with  probability  t  and  may  be  found  from 
Table  1.1  of  Chap.  1. 

f  From  (5)  it  is  clear  that  p(a,p,ni)  =  1  for  (8  =  1  —  a,  and  is  less  than  1  for 
/3  >  1  —  a,  since  Xe  increases  as  e  decreases,  whatever  the  value  of  n. 
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tive  <j\I<j\  <  1  is  rarely  employed.  Nevertheless,  there  are  situations 
in  which  too  small  a  variability  is  disadvantageous.  For  instance,  an 
average  gunner  is  more  likely  to  bring  down  a  bird  with  a  shotgun  than 
with  a  rifle.  It  would  not  be  advisable  to  reduce  the  dispersion  of  the 
shot  to  nearly  zero. 

In  such  cases  the  standard  procedure  is  modified  to 

(a)  Conclude  <j\Io\  >  1  whenever 

~i  >  (o) 

<r0  tti 

(6)  Conclude  a\/a\  <  1  whenever 

i  <  ^^  (7) 

(To  tti 

In  connection  with  this  rule,  the  decision  o\/<j\  <  1  when  in  fact  (j\/g\  >  1 
is  an  error  of  the  first  kind,  whence  the  maximum  value  of  the  probability 
of  an  error  of  the  first  kind  is  a,  which  is  the  probability  of  the  decision 
°i/°o  <  1  when  <j\I<j\  =  1.  Accordingly,  the  statistical  test  defined 
by  (6)  and  (7)  is  said  to  be  conducted  at  the  a  level  of  significance, 
although  x!-a(wi)  is  the  100(1  —  a)  percentage  point  of  x2  for  n\  degrees 
of  freedom. 

The  value  of  pr  =  cf/o-J  for  which  the  probability  of  a  decision 
o"iAo  ^  1  equals  /?  when  the  test  specified  by  inequalities  (6)  and  (7)  is 
employed  is  readily  shown  to  be 

xLttOO 
= 1_ 

p(/3,a,ni) 

Consequently,  values  of  p1  are  found  by  interchanging  a  and  0  and  taking 
the  reciprocal  of  the  corresponding  p.     Thus 

p'(0.05,0.05,Wl)  -  p(Q  05)i -05)Bi)  (9) 

and 

p'(0.01,0.05,Ml)  =  p(a05;01wi)  (10) 

can  be  obtained  from  Table  8.1  by  taking  reciprocals  of  the  relevant 
entries;  and  values  of 


p'(«A»i)  =  ^==^  (8) 


and 


p'(0.01,0.01,n1)  -  p(0.01)0-01)Wi)  (ID 

^(0.05,0.01^0  =  p(aoi;-06     }  (12) 


can  be  obtained  in  similar  manner  from  Table  8.2. 
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3.2.  Illustration 

Example  IV.  The  scores  of  students  taking  a  standard  examina- 
tion have  a  standard  deviation  of  20  points  (<r0  =  20).  An  untried 
examination  will  be  acceptable  if  with  it  students'  scores  have  a 
standard  deviation  not  less  than  20  points  (crf/crg  >  1),  and  the 
authorities  are  willing  to  accept  a  risk  of  0.05  (a  =  0.05)  of  errone- 
ously rejecting  an  examination  for  which  uY  =  or0  =  20.  On 
the  other  hand,  acceptance  of  an  examination  for  which  ci  =  10 
(p'  =  M)  would  be  a  serious  error,  and  it  is  desired  to  hold  the  risk 
of  such  an  occurrence  below  0.01.  How  many  students  will  be 
needed  in  a  preliminary  try  out  of  the  examination? 

For  a  =  0.05  and  fi  =  0.01,  we  seek  the  value  of  n\  for  which 
p'(0.05,0.01,ni)  is  0.25,  that  is,  the  value  nx  for  which  p(0.01,0.05,7i!) 
is  4.     In  Table  8.2  we  see  that 

p(0.01,0.05,16)  =  4.0193  (13) 

p(0.01,0.05,17)  =  3.8526  (14) 

Accordingly,  n\  >  17  will  be  required,  which  means  that  the  smallest 
number  of  students  (wi  +  1)  that  will  suffice  is  18. 

4.  EXPERIMENTS  TO  DETERMINE  WHETHER  A  PARTICULAR  ONE 
OF  TWO  NEW  PRODUCTS  IS  MORE  VARIABLE  THAN  THE  OTHER 

4.1.  Problem 

When  the  variance  of  the  measurable  characteristic  is  unknown  for 
both  products,  as  it  generally  is  when  both  are  new,  the  rules  of  procedure 
given  in  Sees.  2  and  3  cannot  be  applied.  Let  a\  denote  the  true  variance 
of  the  measurable  characteristic  in  the  case  of  product  I,  let  a\  denote  the 
variance  in  the  case  of  product  II,  and  let  s\  and  s\  denote  estimates  of 
<j\  and  <j\  computed  from  measurements  on  I  and  II,  respectively.  A 
choice  between  the  conclusion  <j\/g\  <  1  and  the  conclusion  a\/u\  >  1 
can  then  be  based  on  the  observed  values  of  s{  and  sf,  for  example,  on 
the  observed  value  of  their  ratio  s\/s\.  That  is,  it  may  be  agreed  in 
advance  that  either  the  conclusion  a\/a\  >  1  or  the  conclusion  a\/u\  <  1 
will  be  accepted  according  as  s\/s\  exceeds  or  does  not  exceed  some 
number  C  chosen  in  advance. 

In  selecting  C  it  is  necessary  to  distinguish  between  the  following 
two  situations:  (1)  For  various  reasons  (for  example,  cost,  convenience, 
and  so  forth)  product  I  is  preferred  a  priori  and  should  be  chosen  unless 
a\  is  substantially  larger  than  a\.  (2)  Product  II  is  to  be  chosen  if 
<j\  >  a\,  but  it  is  immaterial  which  is  chosen  when  <r\  =  a\. 
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4.2.  Standard  Procedure  When  Product  I  Is  to  Be  Chosen  Unless  a\  Is 


Larger  Than  er 


4.2.1.  Procedure. — The  standard  procedure  in  case  (1)  is  to  compute 
from  the  data  estimates  sf  and  si  such  that  in  repeated  sampling  nisf/o-f 
and  n2s\l<j\  will  be  distributed  independently  as  x2  for  nx  and  n2  degrees 
of  freedom,  respectively,  and  to  base  the  choice  on  the  following  rule : 

(a)  Accept  product  I  whenever 

%  <  Fa(nhn2)  (15) 

(6)  Accept  product  II  whenever 

4  >  Fa(n1}n2)  (16) 

where  Fa(?ii,n2)  is  the  a  level  of  significance  of  the  variance  ratio  dis- 
tribution 

a  =  P{F(nhn2)  >  Fa(n1}n2)} 
fni  +  n2\ 

\  2   ; 


ni-2 

F   2    dF 


->(1  +  ?)^ 


(17) 


Since,  under  the  assumptions  regarding  the  distribution  of  s\  and  s2,, 
the  ratio  (sf/ 'al) / '{s\/ Vf)  is  distributed  as  F  for  n\  and  n2  degrees  of  freedom 
whatever  the  values  of  a\  and  <r|,  it  is  clear  from  (17)  that  the  probability 
of  (16)  occurring  when  a\  =  u\  is  exactly  a. 

Accordingly,  if  choosing  product  II  when  its  variability  is  not  less  than 
that  of  product  I  (that  is,  when  <j\/g\  <  1)  is  termed  an  error  of  the  first 
kind,  clearly  a  is  the  maximum  value  of  the  probability  of  an  error  of 
the  first  kind  when  the  standard  procedure  is  followed  and  the  distribu- 
tions of  si  and  s|  are  as  assumed.  The  test  is  thus  said  to  be  conducted 
at  the  a  level  of  significance. 

Let  the  true  value  of  the  ratio  (j\/<j\  be  denoted  by 

0  =  1  (18) 

a2 

Then  it  is  of  interest  to  determine  the  value  of  </>  for  which  the  probability 
is  ]8  that  product  I  will  be  chosen  by  application  of  (15)  and  (16).  The 
answer  is  readily  found  from  the  relation* 

*  Shortly  after  this  formula  was  derived,  a  British  document  was  received  in  which 
essentially  the  same  formula  is  given  and  attributed  to  D.  V.  Lindley;  see  Baines, 
Economical  Design, 
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=  Fa(nhn2)  •  Fp(n2,ni) 

Extensive  tables  of  <f>(a,l3,ni,n2)  corresponding  to  a  =  0.05  and  0.01  and 
to  a  range  of  values  of  |S,  rii,  and  n2  are  presented  as  Tables  8.3  and  8.4, 
pp.  284-295  and  298-309.  The  use  of  formula  (19)  in  the  computation 
of  these  tables  is  explained  in  Sec.  6,  which  also  includes  a  discussion  of 
methods  of  interpolation.  Figures  8.3  and  8.4  show  the  operating-charac- 
teristic curves  plotted  from  the  tables,  for  n\  —  n2  —  5,  10,  15,  20,  30,  40, 
60,  120. 

4.2.2.  Two  Special  Cases. — Before  illustrating  the  use  and  interpreta- 
tion of  Tables  8.3  and  8.4,  we  may  notice  two  important  special  cases 
"involving  ri\  =  n2: 

(a)  When  rii  =  n2  =  n  and  /3  =  0.50 

F0.b0(n,n)  =  1  (20) 

for  all  values  of  n,  whence 

<j>(a,0.50,n,n)  =  Fa(n,n)  (21) 

In  other  words,  when  the  degrees  of  freedom  for  s\  and  s\  are  equal,  the 
probability  of  an  error  of  the  second  kind  is  3^2  when  a\/a\  =  Fa(n,n), 
the  critical  ratio  employed  in  the  test.  Thus  Fa(n,ri)  may  be  termed  the 
indifference  value  of  <j\/(j2,  that  is,  the  value  of  v\la\  for  which  the  test 
is  equally  likely  to  select  either  product. 
(6)  When  rii  =  n2  =  n  and  /3  =  a 

cf>(a,a,n,n)  =  Fa(n,n)  (22) 

In  other  words,  when  the  degrees  of  freedom  for  sf  and  s|  are  equal,  the 
probability  of  an  error  of  the  second  kind  is  a  when  <j\ja2^  the  ratio  of  the 
true  standard  deviations,  equals  Fa(n,n),  the  critical  ratio  employed  in 
the  test.  That  is,  the  risk  of  an  error  of  the  second  kind  when  ai/a2  = 
Fa(n,n)  is  the  same  as  the  risk  of  an  error  of  the  first  kind  when  ai/<r2  =  1. 

4.2.3.  Illustrations 

*  Example  V.  A  choice  between  two  products  is  to  be  made,  the 
relative  variability  of  the  two  products  being  relevant  but  of  second- 
ary importance  in  the  sense  that  it  is  immaterial  which  is  chosen  if 
<t\/g\  =  2,  product  I  being  preferred  if  o\I<t\  <  2,  product  II  if 
<j\I<j\  >  2.  If  the  choice  is  to  be  based  on  (15)  and  (16),  using  the 
0.05  level  of  significance  (a  =  0.05),  and  if  sf  and  si  are  to  be  based  on 
an  equal  number  {rii  =  n2  =  n)  degrees  of  freedom,  how  many 
degrees  of  freedom  will  be  needed  for  each,  and  for  what  value  of 
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<j\/(j\  is  the  probability  0.95  that  product  II  will  be  chosen  (0  =  0.05)  ? 
From  the  first  special  case  of  Sec.  4.2.2,  we  have 

0(0.05,0. 50,n,n)  =  F0.0b(n,n), 

and  from  tables  of  the  F  distribution  we  find  this  to  be  1.9838  when 
n  =  24.  Thus,  if  24  degrees  of  freedom  are  used  for  each  product, 
the  probability  that  product  II  will  be  chosen  is  0.05  if  a\/a\  =  1,  0.50 
if  cj\I<j\  =  2,  and  (according  to  case  (b)  of  Sec  4.2.2)  0.95  if  <j\/(t\  =  4. 

Example  VI.  If  only  30  degrees  of  freedom  are  available  for  s\ 
and  if  a  test  is  conducted  at  the  0.05  level  of  significance,  how  many 
degrees  of  freedom  will  be  needed  for  s\  in  order  to  ensure  that  the 
probability  is  only  0.05  of  product  I  being  chosen  when  0-1/0-2  =  1.5? 

The  desired  stringency  cannot  be  achieved,  since  Table  8.3  shows 
0(O.O5,O.O5,3O,oo)  =  2.367,  which  exceeds  <j\/g\  =  2.25. 

4.3.  Procedure  When  It  Is  Immaterial  Which  Product  Is  Chosen  If 

o-i  =  A 

This  situation  arises  when  variability  is  the  primary  basis  for  choosing 
between  two  products.  The  following  rule  is  appropriate  to  this 
situation : 

(a)  Accept  product  I  whenever 


-\  <  Fo.,o(nhn2)  (23) 

S2 

(b)  Accept  product  II  whenever 

-2  >  Fo.6o(wi,w2)  (24) 


where  Fo.so(ni,ni)  is  the  median  of  the  F  distribution  for  n\  and  n2  degrees 
of  freedom.     Note  that  ^0.50(^1,^2)  ^  1  unless  nx  =  n2. 

By  setting  a  =  0.50  in  (19),  it  is  readily  seen  that  when  the  choice 
is  based  on  (23)  and  (24)  the  probability  is  $  that  product  I  will  be  chosen 
if 

a4  =  WO^ftfM.)  =  FF™fi>n*l  (25) 

Conversely,  the  probability  is  /3  that  product  II  will  be  chosen  when 

:i = ♦?-».1  -  ****  -  %£Sr  (26) 
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Since,  under  the  circumstances  assumed,  the  assignment  of  the 
designations  I  and  II  to  the  products  is  arbitrary,  it  is  natural  to  require 
that  (26)  be  the  reciprocal  of  (25) .  For  this  to  be  the  case,  it  is  necessary 
that 

F0(nhn2) 


Fl.bQ(nhn2)  =  Ff}(nhn2)  •  Fi-p(ni,n2) 


Fp(n2,ni) 


(27) 


L* 
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Fig.  8.3. — Operating-characteristic  curves  for  experiments  to  determine  whether  a 
particular  one  of  two  new  products  (processes)  is  more  variable  than  the  other,  0.05 
level  of  significance. 

It  is  clear  that  when  nx  =  n2,  equation  (27)  holds  for  every  value  of  0; 
it  is  believed  that  the  condition  n\  =  n2  is  necessary  for  the  equality 
to  hold.  Consequently,  if  symmetry  of  action  is  considered  desirable  in 
the  situation  considered  in  this  section,  the  test  should  be  carried  out 
with  the  same  number  of  degrees  of  freedom  for  s\  and  s\. 
Values  of 

<KO.5O,0,n,n)  =  Fp(n,n)  =  0(fcO.5O,n,n)  (28) 

can  be  obtained  from  Tables  8.3  and  8.4  by  means  of  (27).     For  example, 
0(0.50,0.05,30,30)  =  0(0.05,0.50,30,30)  =  1.841,  from  Table  8.3. 
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5.  EXPERIMENTS  INVOLVING  THE  ANALYSIS  OF  VARIANCE 
5.1.  Systematic  and  Random  Variation 

Many  experiments  are  analyzed  by  a  technique  (due  to  R.  A.  Fisher) 
known  as  the  analysis  of  variance.*     Tables  8.3  and  8.4  are  useful  in 


.01  Level  of  Significance 
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12 


Fig.  8.4. — Operating-characteristic  curves  for  experiments  to  determine  whether  a 
particular  one  of  two  new  products  (processes)  is  more  variable  than  the  other,  0.01  level 
of  significance. 


planning  a  certain  class  of.  such  experiments,  namely,  those  for  detect- 
ing and  measuring  components  of  random  variation. 

The  factors  associated  with  the  variation  of  a  product  can  be  divided 
into  two  categories: 

(1)  There  are  factors  that  introduce  random  variation.  Lack  of 
perfect  control  at  some  stage  of  a  production  process  often  acts  in  this 

*  Excellent  accounts  of  the  analysis  of  variance,  which  are  especially  pertinent  to 
this  chapter,  are  given  by  Baines,  Economical  Design,  and  Daniels,  "Components  of 
Variance." 
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manner,  and  the  material  produced  exhibits  an  inherent  random  vari- 
ability. This  random  variation  may  have  assignable  components.  For 
instance,  the  thickness  of  washers  punched  from  a  single  sheet  may  be 
regarded  as  varying  at  random  about  the  mean  thickness  of  washers  from 
that  sheet,  because  of  the  inherent  nonuniformity  of  the  sheet;  and 
washers  produced  during  a  given  shift  may  have  still  greater  variation 
about  the  mean  thickness  for  the  shift,  because  of  an  additional  compo- 
nent of  variation  assignable  to  the  random  variation  of  the  mean  thick- 
nesses of  the  respective  sheets  about  the  over-all  mean  thickness  of  the 
sheets  on  that  shift,  and  so  forth. 

(2)  There  are  factors  that  introduce  systematic  variation.  For 
instance,  in  a  production  unit  composed  of  several  similar  machines 
producing  the  same  product  simultaneously,  there  are  generally  sys- 
tematic differences  among  the  machines,  which  superpose  a  pattern  of 
variation  on  the  output  of  that  production  unit. 

In  using  analysis  of  variance  merely  to  determine  whether  the 
over-all  variation  has  a  component  assignable  to  a  particular  factor,  the 
computational  procedure  and  the  mechanics  of  the  statistical  test  are 
the  same  in  the  case  of  a  factor  that  may  contribute  a  component  of  random 
variation  as  in  the  case  of  a  factor  that  may  contribute  a  component  of 
systematic  variation.  On  the  other  hand,  in  planning  such  investiga- 
tions and  in  the  estimation  of  the  magnitude(s)  of  the  component (s), 
the  distinction  has  to  be  taken  into  account,  since  the  situations  between 
which  discrimination  is  required  differ  in  the  two  cases,  leading  to 
differences  in  the  computation  of  probabilities  of  errors  of  the  second 
kind.     We  consider  only  cases  where  random  variation  alone  is  present. 

The  question  arises  as  to  how  to  decide  in  a  given  case  whether  a 
component  of  variation  is  systematic  or  random.  The  answer  depends 
upon  the  extent  to  which  the  experimental  procedure  samples  the  material 
at  random.  If  a  sampling  operation  is  such  that  each  item  of  the 
population  has  an  equal  chance  of  being  included  in  the  sample,  and 
separate  drawings  are  independent  except  for  any  dependence  resulting 
from  the  number  of  items  in  the  population  being  limited,  then  the 
sampling  is  random  and  the  sample  may  be  termed  a  random  sample.* 
When  an  experimenter  selects  two  or  more  treatments  for  comparison 
in  some  respect,  he  rarely,  if  ever,  selects  them  at  random  from  a  popula- 
tion of  possible  treatments;  he  selects  those  be  believes  most  promising. 
Accordingly,  components  of  variation  assignable  to  differences  between 
treatments  are  systematic.     On  the  other  hand,  when  an  experimenter 

*  It  is  important  to  note  that  it  is  the  operation  of  drawing  the  sample  that  is 
random  and  not  the  sample  itself,  since  any  sample  whatsoever  can  be  obtained  by 
either  a  random  or  a  nonrandom  operation. 
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selects  a  sample  of  items  from  a  production  lot  (or  a  sample  of  animals 
from  a  herd)  for  a  study  of  the  effects  of  the  treatments,  he  can  ensure 
that  they  are  a  random  sample  from  the  lot,  or  herd,  by  introducing 
randomization  into  the  sampling  procedure,  for  example,  by  drawing 
cards  from  a  hat  after  thorough  mixing,  or  by  using  a  table  of  random 
numbers.  He  may  consider  such  a  sample  to  be  a  random  sample  from 
the  production  process  (or  the  animal  species)  only  by  making  the 
assumption  that  the  lot  (herd)  itself  is  a  random  sample  from  the  process 
(species).  When  a  particular  experiment  is  being  planned,  or  the  results 
of  a  completed  experiment  are  being  interpreted,  the  following  two 
parallel  sets  of  questions  will  serve  to  focus  attention  on  the  issue  of 
systematic  versus  random  variation: 

(1)  Are  the  conclusions  to  be  confined  to  the  things  actually  studied 
(the  sample  of  items,  or  animals),  or  to  the  immediate  sources  of  these 
things  (the  actual  machines,  or  lots,  or  herds  from  which  the  things 
studied  were  obtained) ;  or  are  they  to  be  expanded  to  apply  to  the  under- 
lying populations  (real  or  hypothetical  population  of  machines,  or  the 
production  process,  or  the  species)  ? 

(2)  In  complete  repetitions  of  the  experiment,  would  the  same 
things  be  studied  again  (the  same  items,  animals,  or  treatments) ;  would 
new  samples  be  drawn  from  the  identical  sources  (the  same  machines,  the 
same  lots,  or  the  same  herds) ;  or  would  new  samples  be  drawn  from  the 
underlying  populations  (samples  of  items  from  different  machines,  or 
animals  from  different  herds  of  the  same  breed)  ? 

5.2.  A  Typical  Analysis  of  Variance 

This  section  gives  a  typical  analysis-of -variance  table  for  cases  in  which 
the  random  deviations  involved  are  normally  distributed.  The  condi- 
tions are  specified  under  which  the  test  of  significance  customarily 
associated  with  this  table  can  be  regarded  as  a  test  of  the  hypothesis 
that  two  variances  a\  and  u\  are  equal,  relative  to  the  alternative  that 
0-2  =  0-2  _|_  K(j\,  where  K  is  a  known  constant  and  <j\  is  an  unknown 
variance.  Under  these  conditions  the  results  given  in  Sec.  4.1  can  be 
used  in  the  planning  and  interpretation  of  experiments  leading  to  analysis 
of  variance  tests  of  significance. 

Let  xn  denote  the  jth  observation  in  the  ?th  group  (i  =  1,  2,  •  •  •  , 
k;  j  =  1,  2,  •  -  •  ,  rii),  let  x{.  denote  the  average  of  the  n*  observations 
in  the  ^th  group  (i  =  1,  2,  •  •  •  ,  k),  and  let  x..  denote  the  average  of 
k 

all  N  =    >    rii  observations;  then  the  analysis-of- variance  table  takes 

t=i 
the  following  form: 
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Variation 

Degrees 

of 
freedom 

Sum  of 

squares 

Mean 

square 

Expecta- 
tion of 
mean 
square 

Between  group 
averages .... 

k  -  1 

si- 

k 

=  y  Ui  (xq 

t-1 

.  -x..)2 

S2b/(k  - 

1)  =  si 

o-2  +  fur] 

Within  groups. 

N  -  k 

Sw  z 

k       m 

■11 

i  =  1  j-  =  1 

(xn  -xi.y 

S2J(N  - 

-  k)  =  si 

<r2 

n  =  (N2  -   S   n\)/[N(k  -  1)]; 


when  rii  =  n2 


•  rik  =  n.n  =  n. 


If  the  x^  are  independently  and  normally  distributed  about  the 
respective  true  means  mi  of  the  groups  with  common  (unknown)  variance 
<t2,  and  if  the  groups  are  a  random  selection  from  a  population  of  groups 
such  that  in  repeated  selection  the  m»  would  be  normally  distributed 
about  an  over-all  population  mean  of  m  with  variance  a%,  the  xq  will  be 
normally  distributed  about  m  with  variance 


likewise 


(k  -  i)4 


(29) 
(30) 


(T2  +  nal 
will  be  distributed  as  x2  for  (k  —  1)   degrees  of  freedom;  and  finally 

<*■-,**  (31) 


will  be  distributed  independently  as  x2  for  (N  —  k)  degrees  of  freedom. 

A  comparison  of  (30)   and   (31)  with  the  assumptions  made  with 
regard  to  s\  and  s|  in  Sec.  4.1  reveals  that  if  we  set 


<t{  =  a2  +  nu\         and         v\  =  a2 

U\  =  (k  —  1)  and        n2  =  N  —  k 


(32) 
(33) 


s2  becomes  identified  with  sf  and  s2  with  si,  and  the  rule  of  procedure 
becomes 

(a)  Accept    a2  =  0   (there   exists   no   between-group   component   of 
random  variation)  whenever 


<Fa(k  -  \,N  -  k) 


(34) 
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(b)  Accept  a\  >  0  (there  exists  a  between-group  component  of  random 
variation)  whenever 


sb< 


>  Fa(k  -  1,  N  -  k) 


(35) 


Since  u\  cannot  be  negative,  the  probability  of  an  error  of  the  first 
kind  (concluding  that  <tJ  >  0  when  actually  a\  =  0)  is  exactly  a,  and 
the  probability  of  an  error  of  the  second  kind  (concluding  that  a]  =  0 
when  actually  u\  >  0)  is  0  when 


•GO 


1  +  n    ^ 


.«, 


k  -  1,N  -k) 


(36) 


If  there  are  n  observations  in  each  group,  n  =  n  and 

N  -  k  =  k(n  -  1); 


SO' 


*J  =  </>k  ft  fe  -  1,  k(n  -  1)] 


1 


(37) 


5.3.  Illustrations 

The  following  examples  illustrate  types  of  questions  to  which  answers 
may  be  obtained  with  the  aid  of  (37) : 

Example  VII.  It  is  suspected  that  differences  between  lots 
contribute  to  the  random  variation  of  a  particular  characteristic  of  a 
certain  product.  To  investigate  this  hypothesis,  n  items  are  to  be 
selected  at  random  and  measured,  from  each  of  k  lots  chosen  at 
random  from  the  lots  currently  accessible.  It  is  agreed  that  the  test 
of  significance  defined  by  (34)  and  (35)  is  to  be  conducted  at  the  0.05 
level  of  significance,  and  it  is  desired  to  have  a  probability  not 
greater  than  0.50  of  falsely  concluding  oj  =  0  if  tf/a2  =  0.10.  What 
combination  of  values  of  k  and  n  will  accomplish  this  with  a  minimum 
total  number  of  measurements? 

Baines  has  prepared  the  following  table  for  answering  such 
questions  when  a  =  0.05  and  0  =  0.50: 


k 

2 

3 

4 

5 

6 

7 

8 

9 

10 

14 

17 

21 

27 

n 

75 

34 

24 

19 

16 

14 

13 

12 

11 

9 

8 

7 

6 

kn 

150 

102 

96 

95 

96 

98 

104 

108 

110 

126 

136 

147 

162 

°V°2 

0.101 

0.101 

0.100 

0.100 

0.103 

0.104 

0.1010.100 

0.102 

0.099 

0.099 

0.100 

0.102 

It  is  seen  that  the  best  combination  is  19  items  from  each  of  5  lots, 
requiring  a  total  of  95  measurements.     However,  if  it  is  more  con- 

*  Baines,  Economical  Design,  tabulates  (37)  for  a  =  0.05,  /3  =  0.50,  k  =  2  to  31 
by  steps  of  1,  and  n  =  1  to  31  by  steps  of  1,  31  to  45  by  steps  of  2,  45  to  93  by  steps  of 
4,  101,  and  121. 


314 


TECHNIQUES  OF  STATISTICAL  ANALYSIS 


Chap.  8 


venient  to  use  4  lots  than  5  lots,  24  items  from  each  of  4  lots  will  be 
equally  effective  and  will  require  only  one  additional  measurement. 
Example  VIII.  If  it  were  agreed  initially  in  planning  a  test  of 
the  type  discussed  in  Example  VII  that  a  total  of  96  measurements 
would  be  employed,  then  the  problem  would  arise  of  determining 
the  combination  of  values  of  k  and  n,  subject  to  the  restriction 
kn  =  96,  that  minimizes  the  value  of  tf/a2  for  a  given  /3. 

Baines    gives    the    figures    tabulated    below,    which    apply   when 
a  =  0.05  and  B  =  0.50: 


k 

2 

3 

4 

6 

8 

12 

16 

24 

n 

48 

32 

24 

16 

12 

8 

6 

4 

<r2g/«2 

0.158 

0.107 

0.100 

0.103 

0.110 

0.126 

0.143 

0.178 

It  is  clear  that  the  combination  k  =  4,  n  =  24  provides  the  most 
stringent  test  having  a  =  0.05  and  kn  =  96,  in  the  sense  that  it 
yields  the  minimum  indifference  point. 

In  order  to  answer  readily  questions  of  the  type  illustrated  by  Exam- 
ples VII  and  VIII  when  a  and  /3  differ  from  0.05  and  0.50,  respectively, 
tables  of  (37)  should  be  prepared  using  k  and  n  as  arguments.  Indeed,  to 
facilitate  the  planning  of  experiments  leading  to  analysis-of-variance 
tests  corresponding  to  two-way  classifications  ("rows"  and  "columns  "),  to 
Latin  square  designs,  and  so  forth,  sets  of  such  tables  will  have  to  be 
prepared  for  each  type  of  design,  since  the  number  of  degrees  of  freedom 
corresponding  to  the  estimate  of  error  variance  s\  depends  upon  the 
design  of  the  experiment  [for  a  two-way  classification,  for  example, 
n2  =  (r  —  l)(c  —  1)},  as  does  also  the  number  of  degrees  of  freedom 
corresponding  to  the  estimate  of  variance  s\  to  be  tested.  Until  such 
tables  are  available,  these  questions  can  be  answered,  with  considerable 
effort  expended  on  interpolation,  by  means  of  Tables  8.3  and  8.4.  Fur- 
thermore, Tables  8.3  and  8.4  will  often  suffice  to  determine  a  few  points 
on  the  operating-characteristic  curve  of  a  test  of  significance  correspond- 
ing to  an  experimental  design  already  agreed  upon. 


6.  TECHNICAL  NOTES 
6.1-  Method  of  Construction  and  Accuracy  of  Tables  8.1  and  8.2 

As  explained  in  Sec.  2,  the  standard  procedure  for  determining 
whether  a  new  product  or  process  is  more  variable  than  a  standard 
product  or  process  is  to  compute  the  statistic  s\/al  and 

r2  <  1  whenever 


(a)  Accept  the  conclusion  <j\/v\ 


«i  <  Xa(ni) 


(38) 
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(b)  Accept  the  conclusion  <j\/<j\  >  1  whenever 


«?      XttOi) 

2    ^ 


rti 


(39) 


where  x«(^i)  is  "the  a  level  of  significance  of  x2  for  ^i  degrees  of  freedom. 
The  level  of  significance  a  represents  the  probability  of  accepting  the 
conclusion  a\/(j\  >  1  when  in  fact  a\/a\  =  1  (error  of  the  first  kind). 
Using  this  level  of  significance,  the  probability  of  accepting  the  con- 
clusion <j\/al  <  1  when  in  fact  <t\/(j\  >  1  (error  of  the  second  kind) 
depends  on  the  ratio  p  =  <j\/<t\  and  is  exactly  /3  if  p  has  the  value 


/        o         \  Xa(^l) 

p(a,/3,tti)  =     2 

Xi-s(^i) 


(40) 


The  proof  is  elementary.  Since  the  conclusion  <j\/<j\  <  1  will  be  reached 
whenever  (38)  occurs,  the  probability  of  this  conclusion,  given  a\  =  pal, 
is 


p^£l  <  x«(ni) 


2  2 

0"!   =  po-o 


^  i  — r  <  -2  Xi(ni) 


Xa(^l) 


(Tj    —    p(70 


(41) 


=  P     X2M  < 


=  1  -  P\  x2(nx)  > 


Xafal) 


From  (40), 


X«M 


Xi-flfai) 


so  the  last  form  of  (41)  becomes 


i  -  (i  -  $  =  u 


(42) 


(43) 


which  completes  the  proof.  * 

Thompson's  table  of  x«(^i)  was  used  in  constructing  Tables  8.1 
and  8.2.  Direct  substitution  of  these  values  in  formula  (40)  yields 
values  of  p(a,(3,n)  corresponding  to  a  =  0.05  and  0.01  and  to  /3  =  0.005, 
0.01,  0.025,  0.05,  0.10,  0.25,  0.50,  0.750,  0.900,  0.950,  0.975,  0.990,  and 
0.995  for  n  =  1  to  30  by  steps  of  1  and  30  to  100  by  steps  of  10.  The 
values  of  p(0.05,/3,n),  rounded  to  5  figures  or  4  decimals,  are  given  in 

*  Values  of  xlM  corresponding  to  a  =  0.01,  0.02,  0.05,  0.10,  0.20,  0.30,  0.50, 
0.70,  0.80,  0.90,  0.95,  0.98,  and  0.99  are  given  to  3  decimals  for  m  =  1  to  30  by  steps 
of  1  in  Table  III  of  Fisher,  Statistical  Methods  and  in  Table  IV  of  Fisher  and  Yates, 
Statistical  Tables,  the  latter  of  which  also  contains  the  values  for  a  =  0.001.  Values  of 
||(wi)  corresponding  to  a  =  0.005,  0.010,  0.025,  0.050,  0.100,  0.250,  0.500,  0.750, 
0.900,  0.950,  0.975,  0.990,  and  0.995  are  given  to  6  significant  figures  for  m  =  1  to 
30  by  steps  of  1  and  from  30  to  100  by  steps  of  10  by  Thompson,  "x2  distribution." 
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Table  8.1.  The  corresponding  values  of  p(0. 01, P,ri)  are  given  in  Table 
8.2.  The  values  of  p(a,fi,n)  for  n  =  120  were  computed  from  values  of 
Xa(120)  obtained  from  the  percentage  points  of  the  F  distribution  tabled 
by  Merrington  and  Thompson,  by  means  of  the  relation 

xl(n)  =  nFa(n,  co)  (44) 

Two-point  interpolation  in  Tables  8.1  and  8.2  in  terms  of  the  argument 
l/\/n  will  yield  about  2-decimal  accuracy  in  most  parts  of  these  tables.* 


6.2.  Method  of  Construction  and  Accuracy  of  Tables  8.3  and  8.4 

The  procedure  described  in  Sec.  4.1  for  determining  whether  a  particu- 
lar one  of  two  new  products  or  processes  is  more  variable  than  the 
other  is  as  follows:  Compute  the  statistic  s\/s\,  where  s\  and  sf  are 
sample  estimates  of  the  two  population  variances  such  that  in  repeated 
sampling  nis\/(j\  and  n2s\/o-\  will  be  distributed  independently  as  x2  for 
n\  and  n2  degrees  of  freedom,  respectively.  Base  the  choice  of  product 
or  process  on  the  rule 

(a)  Accept  product  I  whenever 

S4  <  Fa(nhn2)  (45) 

(b)  Accept  product  II  whenever 

^  >  Fa(ni,n2)  (46) 

S2 

where  Fa(ni,n2)  is  the  a  level  of  significance  of  the  variance  ratio  (F) 
distribution.  This  rule  is  so  formulated  that  the  probability  of  accepting 
product  II  when  in  fact  d\/a\  =  1  (error  of  the  first  kind)  equals  the  level 
of  significance  a.  The  probability  of  accepting  product  I  when  in  fact 
°i  >  al  (error  of  the  second  kind)  depends  on  the  ratio  <f>  =  a\/a\  and 
is  exactly  j3  when 

*<aA"*B»>  "  F^KS)  (47) 

=  Fa(ni,n2)  •  F?(n%tni) 

The  proof  is  similar  to  the  proof  of  (40).  Product  I  will  be  chosen 
whenever   (45)   occurs,   and  the  probability  of  this  when  a\/a\  =  <f>  is 

*  Interpolation  with  respect  to  a  and  (3  should  be  carried  out  in  terms  of  Ka  and 
Kp,  respectively,  where  Ke  is  the  unit  normal  deviate  exceeded  with  probability  e 
(tabled  in  Chap.  1,  Table  1.1).  For  an  explanation  and  examples  of  this  type  of 
interpolation,  see  Simaika,  Interpolation. 
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P  H  <Fa(nhm) 

S2 


S2 


.2 

Fa(n1}n2) 


=  P    F(ni,n.)  < 


=  1  -  P  \F(nhn2)  > 
=  1  -  (1  -  /3)  =  0 


0 

Fa(nhn2) 


when 


M^)  _  F  ,_,(„„„,)  (49) 

Tables  8.3  and  8.4  at  the  end  of  this  chapter  were  computed  from 
the  Merrington-Thompson  table  of  F  by  direct  substitution  in  formula 
(47).  These  tables  give  the  values  of  <j)(a,fi,ni,n2)  for  a  =  0.050  and 
a  =  0.010,  respectively,  corresponding  to  p  =  0.005,  0.010,  0.025,  0.050, 
0.100,  0.250,  0.500,  0.750,  0.900,  0.950,  0.975,  0.990,  0.995,  for  m  =  1  to  10 
by  steps  of  1,  and  12,  15,  20,  24,  30,  40,  60,  120,  oo,  and  n2  =  1  to  10  by 
steps  of  1,  and  12,  15,  20,  24,  30,  40,  60,  120,  qo  .  Interpolation  for  values 
of  4>(a,P,ni,n2)  corresponding  to  values  of  n\  and  n2  between  those  given  in 
these  tables  should  be  done  harmonically,  that  is,  in  terms  of  1/rii  and  \/n2. 
In  this  connection  it  should  be  noted  that  the  sequence  oo ,  120,  60,  40,  30, 
24,  20  corresponds  to  120/m  for  m  =  0,  1,  2,  3,  4;  5,  and  6,  respectively; 
and  that  the  sequence  °o;  60,  30,  20,  15,  12,  10  corresponds  to  60/r  for 
r  =  0,  1,  2,  3,  4,  5,  and  6,  respectively.  Accordingly,  interpolation  is 
most  conveniently  carried  out  in  terms  of  the  arguments  m  and  r.  Two- 
point  harmonic  interpolation  will  generally  give  0  to  sufficient  accuracy 
for  most  practical  purposes.  For  example,  by  2-point  harmonic 
interpolation 

,(0.05,0.05,30,24)  -  ^(0-05,0.05.30,20)  +  ,(0.05,0.05,30,30)  =  ^ 

(50) 

The  actual  value  is  3.660;  customary  2-point  linear  interpolation  gives 

*  The  most  accurate  table  of  Fa(ni,n2)  is  that  of  Merrington  and  Thompson,  which 
gives  the  values  to  5  significant  figures  corresponding  to  a  =  0.005,  0.010,  0.025, 
0.050,  0.100,  0.250,  and  0.500  for  nx  =  1  to  10  by  steps  of  1,  and  12, 15,  20,  24,  30, 40,  60, 
120,  w,  and  n2  =  1  to  30  by  steps  of  1,  and  40,  60,  120,  oo.  Values  of  Fa(nhn2), 
referred  to  as  e2z,  corresponding  to  a  —  0.001,  0.01,  0.05,  0.20,  are  given  (mostly  to 
2  decimals)  in  Table  V  of  Fisher  and  Yates,  Statistical  Tables  for  n\  =  1  to  6  by  steps 
of  1,  and  8,  12,  24,  oo .  and  n2  =  1  to  30  by  steps  of  1,  and  40,  60,  120,  oo . 
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3.719.     Similarly,  by  2-point  harmonic  interpolation 
^(0.05,0.05,30,120)  -  ^(0-05,0.05,30,60)  +  0(0.05,0.05,30,  cc )  =  %m 

(51) 
The  actual  value  is  2.617.     Use  may  be  made  of  the  fact  that 

0(a,j8,ni,oo)  =  p(a,j8,ni)  (52) 

to  obtain  values  of  the  left  side  of  this  equality  for  values  of  n\  included 
in  Tables  8.1  and  8.2  but  not  in  Tables  8.3  and  8.4. 
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CHAPTER  9 

UTILIZATION  OF  LIMITED  EXPERIMENTAL  FACILITIES 

WHEN  THE  COST  OF  EACH  MEASUREMENT  DEPENDS 

ON  ITS  MAGNITUDE 

1.  TYPE  OF  PROBLEM 

A  primary  objective  in  the  statistical  design  of  experiments  is  to 
maximize  the  amount  of  pertinent  information  obtained  from  a  given 
amount  of  time,  equipment,  personnel,  and  supplies,  or,  conversely,  to 
minimize  the  cost  of  obtaining  a  given  amount  of  information.  In  many 
experiments  the  cost  of  an  observation  can  be  regarded  as  a  constant; 
such  is  the  case,  for  example,  in  the  problem  of  Chap.  11.  Sometimes 
the  cost  of  an  observation  is  determined  by  the  independent  variables,  as, 
for  example,  when  an  independent  variable  is  the  amount  of  some  expen- 
sive substance  or  type  of  energy.  This  chapter  is  concerned  with  a  third 
situation,  in  which  the  cost  of  an  observation  depends  on  the  value  of 
the  dependent  variable ;  a  simple  case  occurs  when  the  dependent  variable 
is  determined  by  counting,  so  that  the  time  and  labor  involved  in  a 
measurement  are  proportional  to  its  magnitude.  Insofar  as  the  depend- 
ent and  independent  variables  are  correlated,  the  latter  two  situations 
are  essentially  the  same. 

An  interesting  case  in  which  the  cost  of  an  observation  depends  on 
its  magnitude,  one  which  illustrates  the  interplay  of  statistical  principles 
and  substantive  considerations  in  applied  statistics,  occurs  in  metallurgical 
experiments  to  determine  the  relation  between  time  to  rupture  and 
stress  for  a  given  alloy.  In  investigating  this  relation,  each  observation 
is  obtained  by  subjecting  a  test  specimen  to  a  predetermined  constant 
stress  and  recording  the  amount  of  time  that  elapses  before  the  specimen 
breaks.  The  number  of  observations  that  can  be  obtained  in  a  given 
period  of  time  from  a  single  piece  of  equipment  thus  depends  on  how  long 
it  takes  each  specimen  to  rupture,  which  in  turn  depends  largely  on  the 
stress  to  which  the  test  specimen  is  subjected. 

If  the  logarithm  of  time  to  rupture  is  a  linear  function  of  the  logarithm 
of  stress  (and  there  is  strong  evidence  that  this  is  approximately  so 
for  a  large  number  of  alloys  over  the  range  of  important  stresses),  tests 
at  any  two  stresses  suffice  to  estimate  the  line  relating  the  two  variables, 
and  from  this  line  can  be  predicted  the  time  to  rupture  at  any  other 
stress.  The  error  in  such  a  prediction  depends  on  the  number  of  tests 
from  which  the  line  is  estimated  and  on  the  stresses  at  which  they  are 
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made.  But  the  number  of  tests  made  itself  depends  on  the  stresses  at 
which  they  are  made,  since  the  lower  the  stress,  the  longer  it  takes  for  a 
specimen  to  break,  and  hence  the  fewer  the  tests  that  can  be  made  in  a 
limited  period  with  given  equipment.  This  factor,  taken  by  itself,  would 
argue  for  using  as  high  stresses  as  possible,  in  order  to  have  many  tests. 
However,  the  higher  the  stresses,  the  farther  it  is  necessary  to  extrapolate 
the  line  to  predict  time  to  rupture  at  the  stresses  which  are  practically 
important.  Further,  while  the  percentage  error  in  test  results  seems 
reasonably  independent  of  the  stress  used  for  a  fairly  wide  range  of 
stresses,  it  will  doubtless  increase  if  the  stress  is  raised  sufficiently. 
Thus  there  is  a  real  problem  about  the  best  stresses  at  which  to  test; 
there  are  opposing  considerations  that  have  to  be  evaluated  quantitatively 
to  get  a  definite  answer. 

2.  SPECIFIC  PROBLEM 

It  will  facilitate  the  solution  of  the  problem  to  state  it  more  formally, 
as  follows: 

Given:  (1)  The  relation  between  time  to  rupture  and  stress  is 

t  =  a  +  0s  (1) 

where  t  is  the  logarithm  of  time  to  rupture,  s  is  the  logarithm  of  the 
stress,  and  a  and  jS  are  constants  describing  the  "true"  (but  unknown) 
relationship. 

(2)  The  number  of  specimens  that  can  be  tested  is  limited  solely  by 
the  fact  that  there  are  a  limited  number  of  machines  available  and  a 
limited  amount  of  time. 

(3)  The  percentage  error  in  the  determination  of  time  to  rupture 
does  not  depend  on  stress;  that  is,  if  a  great  many  determinations  were 
made  at  each  of  a  number  of  stresses,  the  percentage  variation  in  the 
determinations  would  be  the  same  at  all  stresses.  This  assumption  will 
undoubtedly  break  down  at  sufficiently  high  stresses,  but  it  seems  to 
be  confirmed  by  the  available  evidence  for  stresses  ordinarily  used. 

Required:  (1)  The  stresses  at  which  to  make  the  tests,  and  (2)  the 
best  apportionment  of  the  tests  among  these  stresses,  in  order  to  estimate 
as  accurately  as  possible  (a)  the  slope  0  of  the  line  relating  t  to  s,  and  (b) 
the  time  to  rupture  at  some  predetermined  stress,  or  range  of  stresses, 
of  practical  importance,  ordinarily  considerably  below  the  stresses  at 
which  the  tests  are  conducted. 

3.  SOLUTION 

The  solution  of  this  problem  is  as  follows: 

(1)  It  is  always  better  to  use  two  testing  stresses  than  to  use  more 
than  two. 
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(2)  For  purpose  (b)  it  might  sometimes  be  best  to  use  only  one  stress 
/So  of  practical  importance.  However,  since  this  would  give  no  informa- 
tion about  the  time  to  rupture  at  any  other  stress,  it  would  ignore  purpose 
(a)  and  is  therefore  excluded. 

(3)  Let  Si  and  $2  be  the  two  best  testing  stresses  ($2  >  Si).  Then 
the  best  ratio  of  the  two  stresses,  say,  R  =  S1/S2,  is  given  by  solving 
the  following  equation  for  R : 

^logeR  =  1  +R~*  (2) 

This  is  the  best  ratio  both  for  estimating  the  slope  and  for  estimating 
the  time  to  rupture  at  So,  provided  So  is  less  than  Si,  the  case  of  primary 
interest.  *  The  best  ratio  of  the  two  testing  stresses  is  therefore  indepen- 
dent of  So  and  depends  only  on  ft.  The  value  of  ft  is,  of  course,  to  be 
determined  from  the  experiment;  but  some  advance  estimate  is  needed 
because  the  relative  times  required  for  tests  at  two  different  stresses 
depend  on  ft.  However,  the  advance  estimate  need  not  be  very  accurate, 
since  even  fairly  large  errors  in  the  estimate  of  ft  will  not  lead  to  any 
great  loss  in  efficiency. 

(4)  Under  the  given  assumptions,  the  absolute  values  of  Si  and  >S2 
cannot  be  determined.  So  long  as  assumption  (3)  of  Sec.  2  is  satisfied, 
namely,  that  the  percentage  error  is  independent  of  stress,  the  higher 
the  testing  stresses  the  better.  In  practice,  therefore,  $2  should  be 
chosen  as  the  highest  testing  stress  for  which  the  percentage  error  can 
reasonably  be  expected  to  be  no  larger  than  at  lower  stresses.  Si  can 
then  be  determined  from  the  R  given  by  equation  (2). 

(5)  Let  fti  be  the  number  of  tests  to  be  made  at  Si  and  n%  the  number 
to  be  made  at  Si.  Let  r  =  ni/n2.  The  ratio  r  is  all  that  needs  to  be 
determined,  since  the  total  number  of  tests,  Ui  +  n2,  is  determined  by 
$1  and  $2  and  by  the  time  and  equipment  available.  The  best  value  of  r 
depends  on  the  objective.  If  the  objective  is  to  estimate  ft  as  accurately 
as  possible,  the  best  value  of  r  is  given  by 

r  =  R    2  (3) 

If  the  objective  is  to  estimate  the  time  to  rupture  at  So  as  accurately 
as  possible,  the  best  value  of  r  is  given  by 

_e  log  ^T- 
r  =  R    2 *»  (4) 

i        £>o 

logs; 

*  See  Sec.  4.3,  especially  the  last  paragraph,  for  a  discussion  of  the  case  where 
$i  <  So. 
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The  best  value  of  r  in  this  case  depends  on  >S0.  This  means  that  it  is 
not  possible  to  design  the  experiment  so  as  to  get  the  best  estimate  of 
the  time  to  rupture  at  each  of  a  range  of  $o's.  It  is  necessary  to  com- 
promise among  the  objectives. 

(6)  The  following  numerical  calculation  illustrates  the  solution. 
For  one  particular  series  of  cast  alloys  tested  at  a  particular  elevated 
temperature,  /3  is  found  to  be  approximately  —8.  If  $2  is  taken  as  30,000 
pounds  per  square  inch  (p.s.i.),  the  best  value  of  Si  turns  out  to  be  22,000 
p.s.i.  The  best  apportionment  of  the  tests  between  the  two  stresses  is  as 
follows: 


Objective  is  to  estimate  as 
accurately  as  possible 


(a)  Slope 

(6)  Time  to  rupture  at  stress  of 
5,000  p.s.i 

10,000  p.s.i 

15,000  p.s.i 

20,000  p.s.i 


Number  of  tests  to 

be  made  at  22,000 

p.s.i.  for  each  test 

at  30,000  p.s.i. 


Percentage  gain  in  precision 

over  using  0.50  tests  at 

25,000  p.s.i.  for  every  test 

at  30,000  p.s.i. 


39 

42 
45 
47 
55 


The  last  column  gives  the  gain  in  precision,  which  is  equivalent  to  the 
savings  in  testing  for  a  given  degree  of  precision,  over  an  alternative 
combination  of  stresses  and  relative  number  of  observations,  which 
differs  only  moderately  from  the  optimum  combination;  it  indicates  the 
value  of  careful  planning  of  the  stresses  at  which  to  test  and  of  the  relative 
number  of  tests  at  different  stresses. 


4.  MATHEMATICAL  DEVELOPMENT 


4.1.  Notation 


Let  T  =  time  to  rupture 

t  =  logarithm  of  time  to  rupture 
S  =  stress 

#o  =  stress  for  which  it  is  desired  to  estimate  time  to  rupture 
$i  =  smaller  of  two  testing  stresses  to  be  used 
£2  =  larger  of  two  testing  stresses  to  be  used 
n\  =  number  of  tests  to  be  made  at  Si 
n2  =  number  of  tests  to  be  made  at  S2 
n  =  rii  +  n2 
R  =■  S1/S2 
r  =  n1/n2 
a,  j8  =  parameters  of  line  relating  log  T  to  log  S,  so  that 


log  T  =  a  +  j9  log  S 


(5) 
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a,  b  =  estimates  of  a  and  |8 
to  =  estimate  of  logarithm  of  time  to  rupture  at  stress  So 
K  =  total  amount  of  machine  time  available 
a  =  standard    deviation    of    observations    about    the    line    (5), 

assumed  the  same  for  all  S 
Gb  =  standard  deviation  of  estimated  slope 

<jt  =  standard  deviation  of  logarithm  of  estimated  time  to  rupture 
For  convenience,  take  all  logarithms  to  be  natural  logarithms. 
We  assume  that  the  parameters  of  the  line  are  estimated  from  the  data 
by  the  usual  least-squares  procedure  applied  to  the  logarithms  of   the 
variables.     Then,  by  algebraic  manipulation  of  the  usual  formulas,* 

«l  =  7 —2  (6) 


< 


n.(log|;)    +'*(1°g|) 

/,    sX 

»I»S  I  log  J  I 


(7) 


The  problem  is  to  minimize  (6)  and  (7)  by  proper  choice  of  nh  n2,  Si, 
and  *S2,  subject  to  the  constraint  that  a  fixed  amount  of  machine  time  is 
used.     Since  T  =  eaS^,  this  constraint  is 

ni<?«£?  +  ntf*S%  =  K  (8) 

or 

niS?  +  n2S^2  =  Ke~«  =  k  (9) 

It  will  clarify  the  solution  and  yield  useful  supplementary  results  to 

proceed  to  the  final  solution  in  stages,  rather  than  at  one  step.     We  shall, 

therefore,  first  assume  Si  and  $2  given  and  determine  the  optimum  value 

of  r;  thereafter  we  shall  proceed  to  determine  the  optimum  value  of  R. 

4.2.  Objective  to  Get  Best  Estimate  of  Slope 

The  problem  is  to  maximize 


{logsJ  (10) 


of       ni  +  n2 

subject  to  (9),  since  this  will  minimize  the  variance  of  the  slope  as  given 
by  (6). 

Replacing  ni  by  rn2  in  (9)  and  (10),  solving  (9)  so  modified  for  n2,  and 
substituting  the  result  for  n2  in  (10)  gives 

kr  (log—1) 

*  See  Fisher,  Statistical  Methods,  Sec.  26,  pp.  133,  135.     (For  full  citation,  see 
Sec.  5.) 
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to  be  maximized  with  respect  to  r,  without  restraint  and  with  Si  and  S2 
assumed  fixed.  Setting  the  derivative  with  respect  to  r  equal  to  zero 
gives 

dU)       ./,      &Y         CSg  -  r'Sj) 

JT.   =  "  V0g  Sj    (1  +  r).(rflf  +  W  =  °  (12) 

A  solution  to  (12)  is  obtained  by  setting 

£?  -  r2£?  =  0  (13) 


or 

22    2  (14) 


-  (i)"1  - 


where  we  take  the  positive  square  root,  as  r  is  essentially  positive.  A 
check  on  the  second  derivative  shows  that  this  solution  does  maximize 
(11).     Substituting  (14)  into  (11)  and  taking  the  reciprocal  gives 

4  _  (iSi5  +  s^y  (is) 


K)' 


as  the  minimum  ratio  of  the  variance  of  the  estimated  slope  to  the 
variance  of  an  individual  observation,  given  Si  and  aS2- 
Replacing  Si  by  RS2  in  (15)  gives 

0-2       k(\ogRy  l   ; 

Consider  $2  as  fixed  and  minimize  (16)  with  respect  to  R;  this  gives 


'GO 


2^  (jg  +  1) 
dR  kR(\og  Ry 

whence 


£  R2  log  7?  -  R2  -  1  I    =0  (17) 


].. 


I  log  Jf2  =  i+iri  (is) 

A  check  on  the  second  derivative  shows  that  this  solution  does  minimize 
(16). 

Substituting  the  value  for  log  R  from  (18)  in  (16)  gives 


Sec.  4.3 
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as  the  minimum  value  of  (16),  given  S%.  Since  (3  is  negative,  it  is  clear 
that  the  larger  S2,  the  smaller  the  variance;  hence  there  is  no  finite 
optimum  value  of  S2.  In  practice,  increases  in  S2  beyond  some  point  will 
increase  a2,  and  the  optimum  value  of  S2  will  be  somewhere  near  this 
point.* 

4.3.  Objective  to  Get  Best  Estimate  of  Time  to  Rupture  at  a  Given  Stress 

The  problem  here  is  to  minimize  (7)  subject  to  (9).  Replacing  ri\  by 
rn2  in  (7)  and  (9),  solving  (9)  so  modified  for  n2,  and  substituting  the 
result  in  (7)  gives 


faj$  +  fa$]**  +  l* 


kr 


hi)' 


(20) 


to  be  minimized  with  respect  to  r,  without  restraint  and  with  Si  and  $? 
assumed  fixed.     Setting  the  derivative  of  (20)  with  respect  to  r  equal  to 


zero  gives 


0) 


dr 


■(*fj 


kr 


«{«.£)■ -*(*£) 


0     (21) 


From  (21)  the  minimizing  solution  is  given  by 


(22) 


or 


0,      So 
-log 


±  \sj  .     £0 

gs~ 


(23) 


The  positive  sign  is  required  to  make  r  positive  if  So  is  less  than  Si  or 
greater  than  S»;  the  negative  sign,  if  Si  <  So  <  $2.  The  second  deriva- 
tive shows  that  these  signs  also  give  minima.  Since  we  are  concerned 
primarily  with  values  of  So  that  are  below  Si,  we  shall  hereafter  consider 
only  the  positive  sign. 

*  It  will  pay  to  go  somewhat  beyond  the  point  at  which  <r2  starts  to  increase,  since 
the  loss  from  the  increase  in  <r2  will  for  a  time  be  less  than  the  gain  from  the  increase 
in  Si. 
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Substituting  (23)  into  (20)  gives  as  the  minimum  ratio  of  the  variance 
of  the  estimated  rupture  time  to  the  variance  of  an  individual  observa- 
tion, with  Si  and  Si  fixed, 


0  Of  0  o    "I! 

Vlog^  +  S/loggJ 


K)" 


(24) 


On  replacing  Si  by  RS%,  (24)  becomes 


■i 


§1 

k 


(1    +  i?2)  log 


logi? 


(25) 


from  which,  holding  $2  fixed, 


($)_ 


2£? 


dft 


(1  +  #)  loi 


£0 
S, 


logR 


g^log^-l-^\log|' 


#(log#)2 


=  0 


(26) 


For  aSo  less  than  Si,  there  is  no  value  of  R  that  will  make  the  first  bracket 
equal  to  zero.     Accordingly,  the  minimizing  value  of  R  is  given  by 

0  P 


0 


or 


R2  log  R  -  1  -  #2 


log  72  =  1  +  R 


0 


(27) 


(28) 


which  is  the  same  as  (18),  the  solution  for  minimizing  the  variance  of 
the  slope.  This  is,  of  course,  the  minimizing  value  only  if  #2  is  sufficiently 
large  so  that  the  derived  Si  is  larger  than  S0. 

By  differentiating  (25)  with  respect  to  $2,  it  can  be  shown  that  (25) 
decreases  as  #2  increases,  so  that  there  is  no  finite  optimum  value  of  $2. 
,  If  $2  is  chosen  greater  than  So,  while  Si  derived  from  the  R  satisfying 
(28)  is  less  than  So,  (28)  no  longer  gives  the  optimum  solution.  In  that 
case,  it  turns  out  that  the  variance  is  a  monotonically  decreasing  function 
of  R.  The  best  that  can  be  done  is  to  take  the  largest  possible  value  of 
R,  namely,  R  =  S0/S2,  or  Si  =  So.  This  solution  using  two  testing 
stresses  will  not,  however,  be  as  good  as  that  putting  all  the  tests  at  $0. 
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This  chapter  was  prepared  by  the  editors. 

With  the  exception  of  the  charts  and  related  material,  it  is  based  on 
a  memorandum  by  J.  Wolfowitz,  Columbia  University 


CHAPTER  10 

EXPERIMENTS  IN  WHICH  THE  INDEPENDENT  VARIABLE 

IS  KNOWN  ONLY  IN  TERMS  OF  A 

PROBABILITY  DISTRIBUTION 

1.  PROBLEM 

1.1.  Type  of  Problem 

This  chapter  deals  with  a  type  of  problem  in  which  it  is  desired  to 
subject  each  of  a  number  of  items  to  a  test  of  at  least  a  given  severity, 
but  the  severity  of  any  one  test  cannot  be  measured  and  cannot  be  con- 
trolled precisely.  Thus  it  is  necessary  to  make  an  independent  study  of 
the  probability  that  a  test  will  attain  the  specified  severity  and  then  to 
determine  the  number  of  tests  necessary  for  each  item  in  order  to  attain 
a  given  probability  that  at  least  one  test  will  be  of  the  specified  severity. 
The  particular  problem  discussed  is  relatively  simple,  and  the  chapter  is 
intended  more  to  suggest  an  interesting  and  in  some  respects  important 
type  of  problem  and  to  indicate  an  approach  to  it,  than  to  present  a 
solution  or  a  novel  application  of  statistical  techniques. 

1.2.  Specific  Problem 

A  gun  barrel  is  considered  fit  for  service  with  respect  to  its  pressure- 
resisting  properties  if  it  has  fired  a  round  such  that  the  gas  pressure 
developed  is  not  less  than  a  prescribed  pressure  K,  K  being  greater  than 
any  pressure  that  may  reasonably  occur  in  service.  Special  ammunition 
that  develops  high  pressures  is  used  for  proofing  the  barrels.  In  order  to 
measure  the  pressure  developed  by  a  round,  a  hole  must  be  bored  in  the 
barrel,  which  renders  it  unfit  for  service.  Consequently,  it  is  impossible 
to  ascertain  with  certainty  that  a  barrel  intended  for  service  has  actually 
been  proofed.  It  is,  however,  possible  to  draw  conclusions  about  the 
percentage  of  barrels  proofed  from  a  study  of  the  characteristics  of  the 
special  ammunition.  When  a  lot  of  the  latter  is  delivered,  a  small  fixed 
sample  of  size  n  is  tested  for  pressure  and  the  remainder  (which  is  almost 
the  whole  lot)  is  used  for  proofing,  N  rounds  being  fired  in  each  barrel 
to  be  proofed.  Manufacture  of  the  special  ammunition  is  in  a  sufficiently 
satisfactory  state  that  the  proportion  of  defective  rounds,  that  is,  those 
which  generate  a  pressure  less  than  K,  is  usually  very  small  (of  the  order 
of  0.001).  The  problem  is,  on  the  basis  of  the  results  from  the  sample  of 
n,  (1)  to  set  up  criteria  for  accepting  or  rejecting  the  lot,  and  (2)  to 
determine  N,  the  number  of  rounds  to  be  fired  in  each  barrel, 
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•3  *  -3  6  1  8  9  10 

Observed  proportion  ( x/n  ) 

Fig.  10.1. — Ninety-nine  percent  confidence  belts  for  population  proportion. 
2.  SOLUTION 
Choose  y,  a  confidence  coefficient  with  which  to  operate,  the  minimum 
number  of  barrels  that  an  acceptable  lot  should  proof,  and  the  minimum 
tolerable  probability  of  proofing  a  barrel.  From  the  sample  of  n  obtain 
an  estimate  py  with  confidence  coefficient  y  of  an  upper  bound  on  the 
proportion  of  rounds  in  the  lot  which,  when  fired,  will  generate  a  pres- 
sure less  than  K.  How  this  estimate  is  to  be  obtained  is  described  in 
Sec.  3.     Construct  a  table  as  follows: 


Number  of  rounds  (N) 

fired  in  each  barrel 

1 

2 


With  confidence  coefficient  y, 

the  probability  that  a  barrel 

will  be  proofed  is  at  least 

1     -Py 

1  -p\ 


From  the  table  select  the  smallest  N  yielding  the  required  probability  of 
proofing  a  barrel.     If  this  N  is  too  large  to  enable  the  lot  of  ammunition 
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Observed  proportion  (  x/n  ) 

Fig.  10.2. — 'Ninety-five  percent  confidence  belts  for  population  proportion. 

to  proof  the  minimum  required  number  of  barrels,  reject  the  lot.     Other- 
wise, fire  N  rounds  in  each  barrel. 

3.  CONFIDENCE  LIMITS  FOR  PROPORTION  DEFECTIVE 
3.1.  Attributes 

If  the  upper  confidence  limit  py  is  determined  by  attributes,  that  is,  by 
classifying  each  round  in  the  sample  of  n  as  above  K  or  below  K,  it  is 
unnecessary  to  make  any  assumption  whatever  about  the  distribution 
of  the  pressures  of  the  bullets  of  the  lot.  It  is  only  necessary  that  the 
sample  of  n  be  selected  at  random.  If  it  is  believed  that  defective 
rounds  do  not  occur  independently  (for  example,  if  a  machine  functions 
improperly,  all  the  bullets  it  produces  may  be  defective  and  these  may  be 
adjacent  in  the  lot),  the  TV  bullets  to  be  fired  from  any  gun  barrel  should 
be  chosen  at  random  from  the  lot. 

Perhaps  the  most  accurate  convenient  method  of  finding  py  is  with 
the  aid  of  Thompson's  table  of  percentage  points  of  the  incomplete 
beta  function.     Let 
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Fig.  10.3. — Ninety  percent  confidence  belts  for  population  proportion. 

c  =  number  of  rounds  in  the  sample  of  n 

whose  pressures  are  less  than  K  (1) 

a  =  100(1  -  7)  (2) 

vi  =  2(c  +  1)  (3) 

v2  =  2(n  -  c)  (4) 

Enter  the  table  with  v\  and  v2  on  the  a  percentage-point  page  and  find  x. 
Then  py  =  1  —  x.  For  example,  if  7  =  0.975,  c  =  0,  n  =  30,  we  have 
a  =  2.5,  v\  =  2,  v2  =  60;  from  the  table  we  find  x  =  0.88430,  or  #0.975 
=  0.1157.     The  table  of  Sec.  2  then  appears  as  follows: 

Minimum 


AT 

probability 

1 

0.8843 

2 

0.9866 

3 

0.9985 

etc. 
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Population  proportion  (P) 
1. 


—--■"■ —  ^-'^-?=§R 

^-^"V^g^lS 

**■'       „"*^^5§S3£ 

^       ^'X,'^^m%yJ- 

on                                                                                                        ^               SS&AWAW 

y        *'1*'1*' ^6ffi$fc£j4 

y          S          SSSy&Zpl/_;t 

y      S      ^  z.  Sy y^zwyt  ti 

y        y  y  ^7\y  yzz^vv  ytt 

pn                                                                                S               SSSSJ^ZZtX/ -/  -^T 

Z'         s  *  yS  Jzz&?£2.  1  Ll 

S            S    £    7 7 77744^Z    7    J-JL 

z               SS    * '-/■ '    &Z&77-    7    v    7 

££        *  s  ^  +  zzvzzzz/  y  y  '  ., 

yn                                           -^              /LS-2-    £    £/./z£77jT _f    Z    J      1 

f               Z    ^    7       2    Z    /?^    Z^    7       ^    7         T 

/       y]/  y  / /\//a// //  7  /  /      > 

2           ^  ^^  t :  Z7/7Z7 '7_y  7-  y    Z         1 

fio                   ^7          ,7    7rt^  Zizzzzz/zy   /-  y  z        y 

y         y  ^  y^/zzzzzzzy  /   '  /        i 

y        7   z  y^yMzzzzzyy  yy  y      y 

y        x  7  /  /*/////  /  /  /        / 

7                                y'          /            .7         y       ////$%$//       7          /             /          ,'                                   y 

7         f~  z  _y  7  zzz^i&-§y ~?  >    z  v          7 

7                 >^    7     7^^    7    7^77/Ty'i  ^7*     >       7     >                   / 

^         +    z  y  /yzzzzz/y\y/  /         / 

-/■       7   ^  v^  yyzyziyz^^x/  s       s 

40    2_       ,/  7  x  /  /7/7//Z  /  Xy<y-  7          7 

40  /        /  /   /  /////// zy  /  yiy        y 

h       z  ,/_/  yzzzzzzzi^  fy  J^        7 

i     /  y  z  yzzzzzyy  y y y       y 

*nt   -a  -X  z  yzzyzy/zy  yyy      z&* 

■zot  _/  z  y/zzzXy/yy  7  y       Z- 

2  /  7  yzzzczy y y  yy      s 

y J y yz/zzzz7 y y  z  y       ?                     ± 

%  t? y zzz&zyls y s  y      y                       ± 

?0  L7  z  zyyzzzzy y y y ~,y    ^y                 y    y 

iy+  Z7z/y?y  y yyy      s                          y 

tjy  yyAZysiy/s  y      y                            y 

yzyiimyyy  y    y                             ± 

ly^z^z^zyyy^       y                                      y 

10  fzzztzzyyyyy    ^s 

10  ±/iW2zyysy    ^                          y 

ftzwyy^y  ^ 

TAwyyy^  ^ 

%wyy^^^ 

nVzt^^^ 

.10 


.20 


.30 


.70  .80  .90  1.00 

Fig.  10.4. — Eighty  percent  confidence  belts  for  population  proportion. 


.40  .50  .60 

Observed  proportion  (Vn) 


Note:  When  c  =  0,  py  =  1  -  (1  -  y)*. 

When  7  =  0.995,  0.975,  0.95,  or  0.90,  py  may  be  found  from  Figs. 
10.1,  10.2,  10.3,  or  10.4.*     These  show  belts  within  which  we  may  assert, 

*  Figures  10.1  and  10.2  are  reproduced  from  Pearson  and  Clopper,  "  Confidence 
Limits,"  by  permission  of  Biometrika.  Figures  10.3  and  10.4  are  based  on  computa- 
tions from  Thompson's  "Incomplete  Beta  Function"  made  (in  cooperation  with  the 
Statistical  Research  Group)  by  R.  L.  Deckinger  of  the  Eureau  of  Ordnance,  U.S.  Navy, 
and  furnished  through  the  courtesy  of  R.  H.  Park.     (For  full  citations,  see  Sec.  5.) 

With  regard  to  Figs.  10.1  and  10.2,  Professor  E.  S.  Pearson  writes  in  part: 
"Actually  I  believe  they  are  not  completely  accurate,  as  the  levels  at  certain  points 
were  obtained  by  interpolation  and  not  by  direct  calculation  from  the  'Incomplete 
Beta  Function.'  If  we  reproduce  such  diagrams  ourselves,  we  shall  have  to  look  into 
this  point."     (Letter  of  January  16,  1946,  to  one  of  the  editors.) 

For  Figs.  10.3  and  10.4,  abscissas  were  computed  for  ordinates  between  0  and  1  by 
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with  confidence  coefficient  of,  say,  V,  that  the  true  proportion  lies  when 
the  observed  proportion  is  x/n.  For  the  present  purposes  7  is  related 
to  T  by 

r  =  27  -  1  (5) 

Thus,  for  the  example  of  the  preceding  paragraph  (where  7  =  0.975), 
r  =  0.95.  From  Fig.  10.2  we  see  that  the  upper  curve  for  n  =  30  cuts 
the  axis  x/n  =  0  at  about  0.11  or  0.12. 

Still  another  method  of  finding  py  is  by  means  of  the  arcsin  trans- 
formation described  in  Chap.  16.     Thus 


•       J  .  IC        .         K1-y\ 

py  =  sin2  I  arcsm  +  — ~  ) 


(6) 


where  i£i_7  is  the  normal  deviate  exceeded  with  probability  1  —  7  (see 
Table  1.1  of  Chap.  1).  This  formula  is  not  satisfactory,  however,  when 
c/n  is  very  small,  especially  not  when  c  =  0. 

It  should  be  noted  that  the  foregoing  assumes  the  lot  to  be  large 
enough  relative  to  the  sample  so  that  the  effect  of  lot  size  can  be  ignored; 
this  will  almost  always  be  the  case  in  practice. 

3.2.  Variables 

If  it  can  be  assumed  that  the  gas  pressures  of  the  bullets  in  the  lot  are 
random  independent  observations  on  a  normally  distributed  variate,  py  can 
be  determined  by  the  method  described  in  Chap.  1,  Sec.  8.2  (pp.  42-45).* 
The  value  of  V  given  by  equation  (5)  of  this  chapter  is  used  as  the  value 
of  7  in  Chap.  1;  in  other  words,  Lp  of  Chap.  1  corresponds  with  1  —  7  of 
this  chapter. 

4.  SIZE  OF  PRELIMINARY  SAMPLE 
While  the  rounds  expended  in  testing  the  initial  sample  of  n  do  not 
serve  directly  to  proof  machine-gun  barrels,  it  would  be  an  error  to 

steps  of  0.1,  using  Thompson's  "Incomplete  Beta  Function."  For  a  confidence 
coefficient  r  (in  Fig.  10.3,  r  =  0.90;  in  Fig.  10.4,  r  =  0.80),  the  upper  (lower)  abscissa 
for  a  given  value  of  n  represents  the  value  of  x/n  that  will  be  exceeded  (fallen  short  of) 
with  probability  (1  —  r)/2  in  samples  of  size  n  from  a  binomial  population  with 
parameter  P.  A  smooth  curve  was  passed  through  the  11  points,  but  is  shown  on 
the  chart  only  over  the  range  1/n'  <  x/n  <  1  —  1/n',  where  n'  is  the  next  sample 
size  above  n  shown  on  the  chart.  Outside  this  range  the  curve  was  bent  so  that  at 
abscissas  0  and  1  the  ordinates  are  1  —  (1  —  r)1/n  and  (1  —  T)1/n,  respectively, 
instead  of  1  -  [(1  -  T)/2]1/n  and  [(1  -  T)/2]1/n.  The  reason  for  the  difference  in 
computation  at  x/n  =  0  and  1  is  that  there  the  confidence  interval  must  lie  entirely 
to  one  side  of  the  observed  proportion.  In  deciding  where  to  bend  the  curve,  it  was 
assumed  that  a  given  curve  will  never  be  used  in  the  intervals  0  <  x/n  <  1/n'  and 
1  —  1/n'  <  x  <  1,  and  that  only  for  interpolation  will  it  ever  be  used  in  the  intervals 
1/n'  <  x/n  <  1/n  and  1  -  1/n  <  x/n  <  1  -  1/n'. 

*A  slightly  more  accurate  method  is  given  in  Chap.  1,  Sec.  11.4.6,  Problem  II, 
where  e  corresponds  with  1  —  7  of  this  chapter  (see  pp.  76-77). 
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characterize  them  as  wasted  and  to  try  to  keep  n  as  small  as  possible. 
Actually  these  rounds  supply  information  about  the  quality  of  the  ammu- 
nition, and  more  information  can  easily  result  in  an  actual  saving  of 
ammunition.  Suppose,  for  example,  that  a  lot  contains  10,000  rounds,  of 
which  (unknown  to  us)  10  (0.1  percent)  are  defective,  that  we  wish  to 
operate  with  a  0.99  confidence  coefficient,  and  that  the  minimum  tolerable 
probability  of  proofing  a  barrel  is  to  be  0.99.  Suppose  further  that  in  a 
sample  of  size  20  (n  =  20)  no  defectives  appear  (c  =  0).  Proceeding  as 
in  Sec.  3.1,  we  construct  the  following  table: 

n  =  20  t  =  0.99  c  =    0 

a  =  0.01  vi"=  2  P2  =  40 

x  =  0.79433         00.99  =  0.20567 

Minimum 

N       probability 

1  0.79 

2  0  96 

3  0.99  + 

Three  rounds  are  needed  for  each  barrel,  so  the  total  number  of  barrels 
proofed  by  the  lot  will  be 

10-0003  -  20  =  3,326  (7) 

Now,  suppose  that  a  sample  of  60  {n  =  60)  had  been  taken  and  no 
defectives  found.     Then 

n  =  60  y  =  0.99  c  =       0 

a  =  0.01  vi  =  2  v,  =  120 

x  =  0.92612         po.99  =  0.07388 

Minimum 

iV      probability 

1  0.93 

2  0.99  + 

Two  rounds  suffice  for  each  barrel,  and  the  total  number  of  barrels  proofed 
by  the  lot  would  be 

10'0002-60  =  4,970  (8) 

a  49  percent  increase  over  the  previous  result. 

Of  course  the  number  of  defectives  in  the  sample  of  n  is  a  chance 
variable  whose  distribution  depends  on  n,  among  other  parameters.  On 
the  basis  of  previous  experience  with  the  ammunition  being  used,  it  is 
possible  to  determine  that  value  of  n  which  is  optimum  in  the  sense  that 
if  the  quality  of  the  ammunition  is  essentially  unchanged  with  time,  the 
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average   number   of  machine-gun  barrels   successfully   proofed  will  be 
maximized. 
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CHAPTER  11 

PLANNING  AN  EXPERIMENT  FOR  ESTIMATING  THE  MEAN 
AND  STANDARD  DEVIATION  OF  A  NORMAL  DISTRIBUTION 
FROM  OBSERVATIONS  ON  THE  CUMULATIVE  DISTRIBUTION 

1.  THE  PROBLEM 

A  class  of  statistical  problems  of  frequent  occurrence  arises  from  situa- 
tions in  which  a  characteristic  is  conceived  to  be  normally  distributed 
among  the  individuals  of  a  population,  but  measurement  of  any  one 
individual  can  show  only  whether  the  characteristic  is  above  or  below  a 
certain  level.  The  most  familiar  form  of  this  problem  concerns  the 
lethality  of  varying  doses  of  drugs.  The  electric  impulse  necessary  to 
ignite  a  charge  of  powder  in  a  certain  system  represents  another  such 
problem;  for,  although  in  this  case  it  is  possible  to  apply  increasing 
impulses  until  ignition  occurs,  there  is  some  evidence  that  an  impulse 
that  fails  to  cause  ignition  may  alter  the  sensitivity  of  the  system.  In 
such  situations  a  single  stimulus  of  known  intensity  is  applied  to  each 
individual.  The  resulting  data  show  the  proportions  of  the  individuals 
tested  at  various  intensities  whose  sensitivity  fell  below  those  intensities; 
in  other  words,  the  data  for  each  level  of  stimulus  intensity  provide  one 
observation  on  the  cumulated  distribution  of  sensitivities.  These  obser- 
vations may  be  analyzed  to  estimate  the  mean  and  standard  deviation 
of  the  sensitivity  of  the  individuals  comprising  the  population. 

The  present  chapter,  is  concerned  with  certain  aspects  of  the  planning 
of  experiments  that  yield  observations  on  the  cumulated  distribution 
and  are  analyzed  to  determine  the  mean  and  standard  deviation  of  the 
distribution.  For  concreteness,  the  discussion  is  given  in  terms  of  the 
specific  problem  of  planning  an  experiment  to  study  the  sensitivity  of  a 
radio  proximity  fuze.  A  radio  proximity  fuze,  as  its  name  suggests, 
makes  use  of  radio  waves  to  explode  a  projectile  when  and  if  it  comes  into 
proximity  with  a  target;  unlike  time  fuzes  or  point-detonating  fuzes,  it 
requires  neither  that  a  definite  time  of  flight  be  set  in  advance  nor  that 
the  target  be  hit  directly.  If  proximity  fuzes  of  a  given  model  behaved 
identically  there  would,  for  fixed  conditions,  be  a  definite  distance  from 
the  target  within  which  all  fuzes  would  operate  and  beyond  which  none 
would  operate.  Actually,  there  are  random  variations  in  the  fuzes  and 
in  the  impulses  returned  by  the  target ;  so  if  a  number  of  fuzes  pass  exactly 
the  same  distance  from  a  target  under  the  same  conditions,  some  operate 

341 


342  TECHNIQUES  OF  STATISTICAL  ANALYSIS  Chap.  11 

and  some  do  not.  An  important  parameter  of  fuze  performance  is, 
therefore,  the  probability  that  a  fuze  will  operate  if  it  passes  within  a 
certain  distance  of  the  target. 

2.  METHOD  OF  ANALYSIS 

It  has  been  determined  from  a  certain  set  of  1,211  test  firings  made 
under  relatively  homogeneous  conditions  that  the  probability  that  a 
nondefective  fuze  will  operate  if  it  passes  within  a  certain  distance  of  the 
target  can  be  well  represented  by  assuming  (a)  that  each  nondefective 
fuze  is  characterized  by  a  definite  sensitivity,  or  maximum  distance  at 
which  it  will  function,  and  (6)  that  these  sensitivities  are  normally 
distributed  among  fuzes. 

Let  >S  be  the  sensitivity  of  a  fuze,  defined  as  the  maximum  distance  at 
which    that   fuze    fired   under   the    given    circumstances   will 
function 
\x  be  the  true  (population)  arithmetic  mean  sensitivity 
a  be  the  true  (population)  standard  deviation  of  sensitivity 
Then  under  the  above  assumptions  the  probability  that  a  fuze  will  have 
a  sensitivity  between  S  and  S  +  dS  is 


!  (S-m)2 


a  y/%c 


2'2    dS  (1) 


The  proportion  T(d)  of  fuzes  passing  a  given  distance  d  from  the  target 
that  operate  is  then  the  integral  of  (1)  from  d  to  <»,  since  every  fuze 
whose  sensitivity  is  greater  than  d  will  operate  at  d,  while  every  fuze 
whose  sensitivity  is  less  will  not.     Therefore, 


T(d)  =  -  -^=.  /      e      2"2    dS  (2) 


V2v 


In  firing  trials,  it  is  not  possible  to  observe  the  sensitivity  of  a  fuze. 
Each  fuze  can  be  fired  only  once;  all  that  can  be  observed  is  the  minimum 
distance  from  the  target  and  whether  the  fuze  operated  in  proximity  to 
the  target.*  The  observations  obtained  are  therefore  observations  on 
T(d),  the  integral  of  the  normal  curve,  and  not  directly  on  S. 

Methods  for  analyzing  such  data  have  been  developed,  primarily  in 
connection  with  dosage-mortality  curves,  f     These  methods  involve  writ- 

*  Actually,  the  entire  trajectory  of  the  fuze  can  be  observed  and  also  the  actual 
point  where  the  fuze  operated.  However,  as  these  play  no  role  in  the  problem  con- 
sidered here,  they  are  ignored. 

f  See  Bliss,  "The  Calculation  of  the  Dosage-mortality  Curve,"  "The  Comparison 
of  Dosage-mortality  Data,"  "Estimating  the  Dosage-mortality  Curve,"  and  "The 
Calculation  of  the  Time  Mortality  Curve";  also  Fisher  and  Yates,  Tables,  pp.  8-12. 
(For  full  citations,  see  Sec.  6.) 
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ing  (2)  as 


where 


T^  =  ^Tjye~X~2dX  (3) 

Y  =  ±^*  (4) 


An  observed  value  of  T{d)  can  be  converted  into  an  estimate  of  Y  by 
means  of  tables  of  the  normal  probability  integral.*  The  estimate  of  Y 
may  be  denoted  by  y.     Now,  Y  is  linearly  related  to  d  by 

Y  =  -^  +  -d  (5) 

a        a 

Hence  the  observed  data  may  be  analyzed  by  fitting  a  straight  line  to 
the  observed  values  (y,d).  The  usual  direct  methods  of  fitting  a  straight 
line  cannot  be  used,  however,  since  the  variance  of  y  is  not  a  constant  but 
depends  on  Y;  instead,  a  method  of  successive  approximations  is  used. 
Tables  are  available  to  facilitate  fitting  the  straight  line  by  this  method,  f 
To  avoid  negative  values,  some  of  these  tables  use  "probits,"  defined  as 
normal  deviates  increased  by  5,  instead  of  the  normal  deviates  themselves. 
The  function  (5)  was  fitted  to  the  data  observed  in  the  1,211  test 
firings  on  fuze  performance  mentioned  previously.  The  estimates  of  /-i 
and  a  turned  out  to  be  A  and  B,  respectively .  {  Figure  11.1  shows  the 
fitted  function  (5),  as  well  as  90  percent  confidence  limits  on  it.  These 
confidence  limits  were  computed  from  the  equation  § 

(6) 


-^—  +  (d<  - 

-dy^~  l 

~ 

i  =  l 

\  Wi(di  - 

i  =  \ 

-  dy 

where  N  =  number  of  observed  points  to  which  the  straight  line  was 
fitted 
u>i  =  weight  given  to  the  ith.  point  in  getting  the  last  approximation 
d  =  weighted  average  of  observed  values  of  d,  that  is, 


i  =  l 


*  For  example,  Table  1.5  in  Chap.  1  of  this  volume. 

f  For  example,  Fisher  and  Yates,  Tables,  p.  49.  Another  useful  set  of  tables  is 
contained  in  Brown,  Tables  to  Facilitate  the  Analysis  of  Sensitivity  Data.  This  shows 
the  ordinate  Z  of  a  normal  curve,  the  probability  P  that  a  given  normal  deviate  will 
be  exceeded,  Q  =  1  —  P,  Z/PQ,  and  Z2/PQ,  all  to  5  decimal  places  for  values  of  Y 
from  0  to  3  by  steps  of  0.01. 

t  The  numerical  values  of  A  and  B,  which  cannot  be  published,  are  immaterial 
for  the  present  purposes. 

§  See  Bliss,  "The  Calculation  of  the  Dosage-mortality  Curve." 
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?2  =  estimated  variance  of  an  observation  about  the  last  approxi- 
mation, that  is,  if  Yi  is  the  regression  value  for  the  ith  point 
computed  from  the  last  approximation,  then 


JV 


2,    WiiVi 


YiY 


,2    _    t==1 


N 


(7) 


The  problem  then  arose  of  collecting  similar  data  under  different 
conditions.  Theoretical  considerations  gave  little  indication  of  the  way 
in  which  the  performance  of  the  fuze  would  be  affected  by  the  difference 


Normal  deviate 
4 


Observed  value  (d) 
Fig.  11.1. — Fitted  function  with  90  percent  confidence  limits  based  on  1211  observations. 

in  conditions.  The  scale  of  the  experiment  had  to  be  limited  to  at  most 
250  rounds.  The  statistical  problem,  with  which  the  rest  of  this  chapter 
deals,  is  that  of  distributing  these  rounds  among  various  possible  distances 
from  the  target  in  such  a  way  as  to  get  as  accurate  an  estimate  as  possible 
of  T(d).     This  problem  serves  to  illustrate  a  number  of  considerations 
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pertinent  to  such  experiments;  while  no  general,  theoretical  solution  is 
offered,  the  mode  of  approach  should  be  at  least  suggestive  in  similar 
situations. 

3.  STATISTICAL  CONSIDERATIONS  AFFECTING  THE  PLANNING 
OF  THE  EXPERIMENT 

A  paradox  frequently  arises  in  planning  experiments.  The  most  effi- 
cient design  depends  on  the  true  value  of  the  parameter  under  investiga- 
tion, but  if  this  were  known  there  would  be  no  need  for  the  experiment. 
This  paradox  is  usually  evaded  by  basing  the  experimental  design  on  some 
advance  estimate  of  the  parameters  and  trying  to  adjust  the  design  so 
that  the  efficiency  of  the  experiment  is  not  much  affected  by  minor 
differences  between  what  is  true  and  what  is  assumed.  This  method 
itself  raises  difficulties,  for  example,  how  much  of  the  efficiency  that  the 
optimum  design  would  have  if  the  advance  estimate  were  correct  should 
be  sacrificed  to  gain  efficiency  if  the  advance  estimate  is  not  correct? 

This  difficulty  is  present  in  marked  degree  in  the  problem  under 
consideration  and  is  aggravated  by  another  difficulty,  namely,  that  two 
parameters  are  to  be  estimated  and  the  experimental  design  that  is  best 
for  estimating  one  is  not  best  for  estimating  the  other.  The  two  param- 
eters to  be  estimated  are  \i  and  a,  or,  what  is  equivalent,  the  slope  and 
intercept  of  (5).  The  experiment  would  be  most  efficient  if  it  could 
minimize  both  the  variance  of  the  estimate  of  \i  and  that  of  the  estimate 
of  a;  unfortunately,  the  distribution  of  observations  that  minimizes  the 
variance  of  the  estimate  of  /x  is  decidedly  different  from  that  which 
minimizes  the  variance  of  the  estimate  of  <x. 

If  a  were  known,  the  variance  of  the  estimate  of  \i  would  be  minimized 
by  taking  all  observations  at  a  single  distance  equal  to  i±.  But  since  a  is 
not  known,  it  would  be  self-defeating  to  take  all  observations  at  a  single 
distance,  because  (a)  this  would  make  it  impossible  to  correct  the  advance 
estimate  of  /jl,  and  (6)  it  would  give  no  information  at  all  about  a.  If  the 
observed  value  of  T(d)  were  50  percent,  the  estimate  of  \i  from  the  experi- 
ment would  be  equal  to  the  value  of  d  at  which  the  firings  were  made, 
that  is,  it  would  be  the  same  as  the  advance  estimate.  If  the  observed 
value  of  T(d)  were  other  than  50  percent,  the  experiment  would  indicate 
that  the  advance  estimate  of  ju  was  wrong  but  no  new  estimate  could  be 
made.  Only  one  point  on  the  straight  line  (5)  would  be  available,  so  its 
slope  could  not  be  estimated;  hence,  it  would  be  impossible  to  estimate  the 
value  of  d  at  which  T(d)  =  50  percent.  These  statements  are  equivalent, 
of  course,  to  saying  that  no  estimate  of  a  would  be  possible  either.  The 
most  that  could  be  learned  from  an  experiment  with  all  observations  at  an 
estimated  jit  would  be  whether  this  estimate  was  correct. 

The  variance  of  the  estimate  of  a  would  be  minimized  by  taking  all 
observations  at  two  values  of  d,  approximately  1.6  times  the  estimated 
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value  of  a  on  either  side  of  the  estimated  value  of  /*.  Even  though  this 
would  not  give  as  accurate  an  estimate  of  \x  as  could  be  obtained  by 
placing  the  observations  closer  to  /x,  it  is  a  workable  arrangement  in  that 
it  would  yield  estimates  of  both  \i  and  a  even  though  the  advance  esti- 
mates were  in  error. 

From  these  considerations  alone,  the  appropriate  compromise  would 
appear  to  be  to  place  the  observations  in  two  groups  located  symmetri- 
cally about  the  estimated  n  but  closer  to  it  than  the  distance  that  would 
minimize  the  variance  of  the  estimate  of  a.  To  decide  how  much  closer 
would  require  deciding  how  much  it  is  worth  sacrificing  accuracy  in  a  to 
gain  accuracy  in  /x. 

Other  considerations  must,  however,  be  taken  into  account.  Placing 
all  the  observations  at  two  points  puts  great  reliance  on  the  validity  of 
equation  (2)  as  a  description  of  T(d).  This  relation  was  derived  from  a 
single  body  of  data,  for  which  its  validity  cannot  be  considered  conclu- 
sively established;  it  may,  therefore,  not  be  valid  for  the  new  experiment. 
Observations  at  only  two  values  of  d  cannot,  however,  show  whether  it  is 
valid  or  not,  since  there  will  be  only  two  points  to  which  to  fit  (5),  and  a 
straight  line  can  always  be  fitted  perfectly  to  two  points.  In  order  to 
determine  whether  (2),  and  hence  (5),  describe  the  data  reasonably  well, 
it  is  necessary  to  spread  the  observations  among  more  values  of  d.  The 
desirability  of  spreading  the  observations  was  reinforced  in  the  present 
experiment  by  the  fact  that  the  data  were  to  be  used  for  other  purposes, 
in  particular,  to  study  the  location  of  points  of  functioning,  for  which  a 
wide  spread  seemed  desirable. 

Other  considerations  to  be  taken  into  account  in  the  particular  experi- 
ment were  (1)  it  was  not  feasible  to  use  distances  greater  than  about 
A  +  %B ;  (2)  most  of  the  firings  in  the  earlier  experiment  had  been  at 
short  distances,  so  that  it  seemed  desirable  to  have  some  firings  in  the  new 
experiment  at  short  distances  in  order  to  make  a  direct  comparison  of  the 
results  of  the  two  experiments,  independent  of  any  assumptions  about  the 
form  of  T(d). 

4.  POSSIBLE  DESIGNS 

In  the  light  of  the  preceding  considerations,  five  alternative  distribu- 
tions of  250  observations  were  considered.  These  are  shown  in  Table 
11.1. 

Plans  1  and  2  are  suggested  by  considering  how  to  minimize  the 
variance  of  the  estimate  of  a.  The  advance  estimates,  based  on  the 
previous  experiment,  were  A  for  \i  and  B  for  a.  The  250  observations  in 
Plans  1  and  2  are  divided  into  two  groups  and  placed  symmetrically  on 
either  side  of  the  mean.  In  Plan  2,  they  are  about  as  far  apart  as  is 
required  to  minimize  the  estimate  of  o-;  in  Plan  1,  they  are  brought  closer 
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together  to  get  a  more  accurate  estimate  of  /x.  Plan  3  spreads  the  obser- 
vation evenly  over  the  five  classes  spanned  by  Plan  2;  it  is  symmetrical, 
and  its  largest  distance  is  the  largest  that  is  feasible.  Plans  4  and  5  are 
attempts  to  get  some  data  for  the  short  distances  for  comparison  with  the 
previous  experiment.  In  Plan  4,  the  observations  for  the  low  distances 
are  obtained  by  omitting  those  at  A,  since  if  A  is  the  true  mean,  such 
firings  give  no  information  about  <r.  In  Plan  5,  the  observations  for  the 
low  distances  are  taken  equally  from  the  five  groups  in  Plan  3. 

To  measure  the  relative  efficiency  of  these  five  possible  designs,  90  per- 
cent confidence  limits  on  (5)  were  computed  for  each,  assuming  that  the 
correct  straight  line  was  that  obtained  in  the  earlier  experiment, 


Y  =  -  -  4-  —d 
1  B^  B 


(8) 


The  confidence  limits  for  Plans  1  and  2  are  plotted  in  Fig.  11.2,  together 
with  the  confidence  limits  obtained  from  the  earlier  experiment  (shown 
also  in  Fig.  11.1).*  Table  11.2  shows  these  confidence  limits  and  the 
assumed  correct  values  in  somewhat  different  form,  namely,  in  terms  of 
100  T(d),  the  percentage  of  nondefective  fuzes. 

TABLE  11.1 
Alternative  Distributions  of  250  Observations 


Distance 

Number  of  rounds  to  be  fired  under  plan 

1 

2 

3 

4 

5 

A  -  *%B 

A  -  *%B 

A  -  i%B 

A  -%B 

A 

A  +%B 

A  +  i%B 

125 
125 

125 
125 

50 
50 
50 
50 
50 

20 
30 
50 
50 

50 
50 

20 
30 
40 
40 
40 
40 
40 

Two  points  are  fairly  clear  from  Fig.  11.2  and  Table  11.2  (1)  Any 
of  the  five  plans  for  250  observations  gives  as  much  information  as,  if  not 
somewhat  more  than,  the  1,211  observations  used  in  the  earlier  experi- 
ment; (2)  the  five  possible  plans  do  not  differ  substantially  in  efficiency. 
Plan  1,  which  emphasizes  most  strongly  the  accuracy  of  the  estimate  of  /x, 
gives  therefore  the  narrowest  confidence  limits  for  distances  near  the 
mean.  Plan  2,  which  emphasizes  most  the  accuracy  of  the  estimate  of  a, 
gives  the  narrowest  limits  for  distances  far  from  the  mean  but  the  widest 

*  The  confidence  limits  for  Plans  3,  4,  and  5  are  omitted  from  Fig.  11.2  only 
because  they  are  so  close  to  those  for  Plans  1  and  2  that  if  all  five  were  plotted  on  a 
figure  the  size  of  Fig.  11.2,  it  would  be  impossible  to  distinguish  the  five  lines. 
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limits  for  distances  near  the  mean.     The  other  plans  are  in  general 
intermediate. 

On  the  basis  of  this  evidence,  it  was  decided  to  recommend  Plan  5. 
This  plan  covers  the  largest  number  of  values  of  d  and  therefore  comes 
closest  to  satisfying  objectives  other  than  the  minimization  of  the 
variance  of  estimates  of  /x  and  a;  at  the  same  time,  it  does  not  sacrifice 
much  efficiency  in  the  estimation  of  n  and  of  a. 

Normal  deviate 
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Observed  value  (d) 
Fig.    11.2. — Ninety   percent   confidence  limits  on   assumed  function   for   two   experi- 
mental designs  utilizing  250  observations,  and  for  preliminary  experiment  based  on  1211 
observations. 


5.  TECHNICAL  NOTES* 
The  problem  of  determining  the  optimum  distribution  of  observations 
for  estimating  /x  and  a  is  considerably  simplified  by  the  following  two 
requirements,  which  seem  plausible  and  will  be  accepted  here  without 
rigorous  proof: 

*  This  section  presumes  a  knowledge  of  the  methods  of  analyzing  probit  data  (see 
References,  Sec.  6). 
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(1)  The  symmetry  of  the  normal  distribution  implies  that  the  opti- 
mum distribution  of  observations  will  be  symmetrical  about  the  true 
mean  /jl.  Note  that  replacing  d  —  /j.  by  its  negative  will  have  no  effect 
on  variances,  since  these  involve  only  (a)  the  weights,  which  depend  only 
on  the  absolute  value  of  (d  —  ij.)/<t,  and  (b)  the  squares  of  quantities  like 
d  —  ix. 

(2)  Since  the  problem  is  one  of  fitting  a  straight  line  containing  two 
parameters,  the  optimum  distribution  will  involve  placing  all  the  obser- 
vations at  two  values  of  d.  These  two  values  of  d  enter  into  the  variances 
in  the  form  of  linear  combinations  of  weighted  squared  deviations.  If 
one  pair  of  values,  d\  and  d2,  such  that  d\  —  n  =  —  (d2  —  /*),  contributes 
more  to  the  minimization  of  the  variance  than  another  pair  of  values,  say 
dz  and  d±,  such  that  d$  —  /x  =  —  (d4  —  /x),  it  will  continue  to  do  so  no 
matter  how  many  observations  are  shifted  from  d3  and  d4  to  di  and  d2. 

These  two  assumptions  reduce  the  problem  to  one  of  determining  a 
pair  of  values  d\  and  d2  such  that  d\  —  \i  =  —  (d2  —  n) ,  &t  each  of  which 
half  the  observations  are  to  be  placed. 
Let  (i  =  population  mean 

a  =  population  standard  deviation 
m  =  estimate  of  \x 
8  =  estimate  of  a 

Yt  = d-^  a  =  i,  2) 

yi  =  — =  estimate  of  F< 

o 

i    r  °°  -%i 

Ti  =       /      e    2  dx  =  true  proportion  above  di 

V  2tt  J  Yi 

U  =  estimate  of  Ti 
Observed  values  of  U  give,  from  tables  of  probits  or  of  the  normal  dis- 
tribution, estimates  of  yi}  and  hence  two  equations 

Vi  =  ^  0) 

V,  =  ^  (10) 

o 


Solving  for  m  and  s  gives 


m  =  Vld*  -  V2dl  (11) 

2/i  ~  2/2 


di  —  d 


It  is  known*  that 


2/i  -  2/2 


(12) 


<  -  T-^p 


where 

*  See  Bliss,  "The  Calculation  of  the  Dosage-mortality  Curve.' 
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rii  =  number  of  observations  at  di 
1        -^ 

To  a  first  approximation, 

'•  =  &)U<  +  teX-,<  <15» 

The  partial  derivatives  in  (14)  and  (15)  may  be  evaluated  as  follows: 
dm       y2(di  —  d2) 


dyi  (2/1  -  y-2)2 

dm  =_  —yijdi  —  d2) 
dyi  (yi  -  y*)2 

ds     _  —(di  —  d2) 

dyi  (2/i  -  2/2)2 

ds  di  —  d2 

dy*  ~  (2/i  -  2/2)2 
Hence 


(16) 

(17) 
(18) 
(19) 


al  = 


Yi(d!  -  d2y  T1{1  -  7\)    ,    Ff(di  -  d2y  T2(l  -  T2) 


+    ,v         vW  ^^^       (20) 


(Fi  -  F2)4         n^2  '     (Fi  -  Y2y        n2z 

2  _  (di  -  d2y  ri(i  -  rj  |   (di  -  ^2)2  y,(i  -  t2) 

*s       (Y1-  Y2y        mzl      '  "*"  (Fx  -  F2)4        ^     "  lZi; 
Substituting  for  Fi  and  F2  in  (20)  and  (21)  and  simplifying  gives 

s       (di  -  d2)2  L       **!                   nrf       J  {  6) 

So  far,  no  use  has  been  made  of  the  symmetry  requirement.  Equa- 
tions (22)  and  (23)  are  therefore  valid  for  any  pair  of  values  of  d\  and  d2. 
Symmetry  implies  that 

n\  =  n2  (24) 

(di-/*)  =  ~(d2  -  n)  (25) 

Tl  =  1  -  T2  (26) 

z\  =  ^2  (27) 

From  (25), 

d1-  d2  =  2(di  -  /*)  (28) 

Substituting  into  (22)  and  (23)  gives 
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(72  7\(1  -  TO 
2         n&\ 

TiCl  -  Ti) 

a2    !Ti(l  -  TO 
27?        n&\ 

(29) 
(30) 

2(di  -  m)2        ^z2i 

<j^  will  be  a  minimum  if  !Fi(l  —  Ti)/z\  is  a  minimum  with  respect  to 
d\  or,  what  is  the  same  thing,  with  respect  to  Fr.  But  Ti(l  —  Ti)/z\  is  a 
minimum  at  Y\  =  0,  as  can  be  demonstrated  by  differentiating  or,  more 
simply,  by  a  glance  at  the  tabled  values  of  z\/T i(l  —  Ti).*  Hence,  to 
minimize  the  variance  of  m,  with  a  assumed  known,  it  would  be  necessary 
to  put  all  the  observations  at  p.. 

o-2  will  be  a  minimum  if  Ti(l  —  Ti)/zl Y\  is  a  minimum.  Differentiat- 
ing yields  an  equation  that  would  have  to  be  solved  numerically.  Since, 
as  noted  above,  z\/[T\(l  —  T\)]  is  tabled,  it  is  simpler  to  compute  directly 
Y\z\/[T\{\  —  Ti)]  and  determine  numerically  the  value  of  Yi  at  which  it 
reaches  a  maximum.  This  gives  approximately  1.574  or  1.575  as  the 
value  of  Y  for  which  cr2,  is  a  minimum,  f  Hence,  the  variance  of  s  will  be 
minimized  by  putting  half  the  observations  at  d\  —  \x  +  1.57o-,  and  half 
at  d2  =  m  ~  1.57o-. 
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CHAPTER  12 
ABANDONING  AN  EXPERIMENT  PRIOR  TO  COMPLETION 

1.  INTRODUCTION 

Every  experiment  is  designed  in  the  light  of  certain  presumptions 
about  the  underlying  phenomena  to  be  studied,  if  only  the  presumption 
that  the  phenomena  warrant  study.  It  is  not  always  possible,  however, 
to  confirm  these  presumptions  in  advance.  In  such  cases  it  may  be 
desirable  to  introduce  a  sequential  element  into  the  design  of  the  experi- 
ment so  that  the  experimental  program  can  be  adjusted  to  any  facts 
about  the  basic  assumptions  disclosed  during  the  course  of  the  investiga- 
tion. The  present  chapter  considers  a  simple  design  of  this  type,  where 
an  experiment  is  to  be  undertaken  to  estimate  a  parameter  of  a  continu- 
ous distribution,  but  provision  is  to  be  made  to  abandon  the  experiment 
prior  to  completion  if  the  accumulated  evidence  at  any  stage  reveals  that 
the  underlying  population  does  not  warrant  study. 

1.1.  Nature  of  the  Problem 

The  circumstances  giving  rise  to  the  need  of  an  experiment  that 
provides  for  possible  abandonment  are  these :  A  new  mechanism  has  been 
constructed  whose  performance,  in  functioning  models,  is  thought  to  be 
superior  to  that  of  similar  standard  mechanisms.  It  is  accordingly 
desired  to  determine  the  average  and  variability  of  a  particular  perform- 
ance characteristic  from  an  experiment  of  sufficient  size  to  give  reliable 
estimates  of  these  parameters.  There  is  reason  to  fear,  however,  that  the 
proportion  of  breakdowns  may  be  prohibitively  high,  in  which  case  the 
mechanism  cannot  be  used  in  its  present  state  of  development  and 
the  estimation  of  its  performance  parameters  is  of  little  value.  More- 
over, because  the  experiment  will  be  costly  in  time,  personnel,  and  equip- 
ment, considerable  saving  will  result  from  discontinuing  the  experiment 
at  an  early  stage  if  the  mechanism  exhibits  too  high  a  failure  rate.  An 
experiment  is  therefore  desired  that  incorporates  the  following  provisions : 
(1)  If  the  failure  rate  exceeds  a  certain  tolerance  standard,  there  must 
be  a  preassigned  degree  of  assurance  of  rejecting  the  mechanism  and 
terminating  the  experiment;  (2)  if  the  failure  rate  is  below  a  certain 
acceptable  standard,  there  must  be  a  preassigned  degree  of  assurance  of 
completing  the  experiment;.  (3)  the  experiment  should  cease  at  the 
earliest  point  at  which  the  accumulated  evidence  is  sufficient  to  reject 
the  mechanism  with  the  required  degree  of  assurance. 

355 
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1.2.  Given  Features  of  the  Experiment 

Previous  investigation  of  the  mechanism  appears  to  justify  the 
assumption  that  the  performance  characteristic  under  study  is  normally 
distributed  about  some  unknown  average  value.  Moreover,  from  pro- 
visional figures  it  appears  that  a  satisfactory  balance  between  the  cost  of 
the  experiment  and  the  precision  of  the  estimates  can  be  obtained  by  a 
2X2X2X2  factorial  experiment  involving  duplicate  observations, 
that  is,  from  32  observations.  Certain  details  of  the  experiment  can  thus 
be  specified  at  once.  If  the  mechanism  has  an  acceptable  failure  rate,  no 
fewer  than  32  items  can  be  tested,  since  32  measurements  of  performance 
are  required;  on  the  other  hand,  no  more  than  32  functioning  items  need 
be  examined,  since  these  will  yield  the  required  number  of  measurements 
for  estimation  with  the  minimum  amount  of  testing.  Thus  the  criterion 
for  accepting  the  mechanism  can  be  represented  graphically  as  a  straight 
line  by  plotting  the  number  of  items  that  fail  against  the  number  of 
items  tested.  This  acceptance  line  intersects  the  base  line  at  32,  showing 
that  no  fewer  than  32  items  can  be  tested;  it  has  a  slope  of  1,  showing 
that  if  failures  occur,  the  number  of  items  tested  shall  be  only  sufficient 
to  yield  32  performance  measurements.  As  the  experiment  proceeds,  a 
point  can  be  plotted  after  each  test  representing  the  cumulated  number 
of  failures  and  the  number  of  tests  made;  when  such  a  point  reaches  th3 
acceptance  line  the  experiment  is  complete,  for  32  performance  measure- 
ments have  been  obtained. 

1.3.  Statistical  Problem 

If  several  hundred  trials  are  made  without  obtaining  the  32  perform- 
ance measurements  needed  to  complete  the  experiment,  it  is  obvious 
that  the  failure  rate  is  excessive  and  the  experiment  ought  to  be  aban- 
doned. This  suggests  the  advisability  of  introducing  a  rejection  line, 
showing  for  each  number  of  items  tested  the  number  of  failures  that 
would  justify  rejecting  the  mechanism  and  abandoning  the  experiment; 
the  rejection  line  should  intersect  the  vertical  axis  above  0  and  have  a 
slope  less  than  1.  The  two  lines  together  yield  a  closed  region  such  that, 
if  the  point  representing  the  number  of  failures  and  the  number  of  items 
tested  crosses  the  upper  line,  the  mechanism  is  rejected  and  the  experi- 
ment abandoned;  while  if  the  point  crosses  the  lower  line,  the  mechanism 
is  accepted  and  estimates  are  made  from  the  resulting  32  measurements. 

Since  the  lower  line  is  fixed  by  the  conditions  of  the  problem,  the 
statistical  problem  that  remains  is  clearly  to  determine  the  upper  line  in 
such  a  way  as  to  meet  preassigned  requirements  on  the  probabilities  of 
abandonment  and  completion,  with  a  minimum  average  amount  of  test- 
ing.    No  general  solution  to  this  problem  has  been  found.     The  following 
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discussion  will  show,  however,  that  sequential  principles*  can  be  used  to 
give  a  good  approximation  to  the  solution. 

To  formulate  the  problem  more  precisely,  we  assume  that  a  proportion 
of  failures  p2  exists  that  is  considered  unacceptable;  that  is,  if  the  failure 
rate  equals  or  exceeds  p2,  we  wish  to  reject  the  mechanism.  If  the  failure 
rate  is  less  than  p2,  however,  we  are  willing  to  accept  the  mechanism;  and 
in  particular,  if  it  is  less  than  some  value  ph  we  wish  a  high  degree  of 
assurance  of  accepting  the  mechanism. 

Number  of 
failures  (d) 


Abandon 

experiment 

Experiment 

s  ■/ 

complete 

32 


Number  tested  (N) 


Fig.  12.1. 


2.  PROPOSED  SOLUTION 
2.1.  Approximating  the  Rejection  Line  by  Sequential  Principles 

The  use  of  sequential  principles  to  meet  these  requirements  is  sug- 
gested by  the  desire  to  minimize  the  necessary  amount  of  sampling.  A 
sequential  criterion  makes  it  possible  to  reappraise  the  failure  rate  of  the 
mechanism  after  each  test  and  to  determine  on  the  basis  of  the  accumu- 
lated evidence  whether  (a)  to  reject  the  mechanism,  (6)  to  accept  it,  or 
(c)  to  continue  the  experiment.  As  soon  as  the  accumulated  evidence 
is  sufficient  for  decision  (a)  or  (6),  the  experiment  stops  and  further, 
unnecessary  experimenting  is  avoided.  The  usual  sequential  test,  how- 
ever, consists  of  both  an  acceptance  criterion  and  a  rejection  criterion, 
whereas  completion  of  the  test  under  consideration  requires  only  a 
rejection  criterion.  What  we  seek,  therefore,  is  a  sequential  test  which 
makes  no  provision  for  acceptance  but  which  achieves  certain  assigned 
probabilities  of  rejection  for  the  specified  failure  rates  pi  and  p2-  Having 
found  such  a  test,  we  can  provide  for  actual  acceptance  by  combining 

*  See  Chap.  17  for  a  brief  account  of  sequential  analysis. 
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our  sequential  rejection  criterion  with  the  predetermined  acceptance 
criterion,  taking  care  to  allow  for  the  effect  of  this  combination  on  the 
total  probabilities  of  rejection. 

The  desired  sequential  rejection  criterion  can  be  derived  as  a  special 
case  of  the  ordinary  sequential  binomial  test.  The  ordinary  test  can  be 
represented  on  a  chart  similar  to  Fig.  12.1  by  a  pair  of  parallel  straight 
lines  having  a  slope  s  and  intercepts  —hi  and  h2.  A  sequential  test  which 
provides  only  for  rejection,  that  is,  which  consists  solely  of  the  upper 
decision  line,  may  be  regarded  as  a  limiting  case  of  an  ordinary  sequential 
binomial  test  with  hi  =  °° .  This  conception  enables  us  to  use  the  for- 
mulas for  the  usual  two-decision  test  to  obtain  the  operating-character- 
istic (OC)  curve  of  a  one-decision  sequential  test,  that  is,  the  probability 
of  acceptance  Lp  as  a  function  of  the  true  population  proportion  of 
failures  p.     When  fti  =   °o,  the  OC  curve  is  given  by* 

s 

R  ~^  —  1 
p  =  — - — ; when  p  <  s  (1) 


R»    h> 


and  by 


Rp  =  1  — Lp  =  1  when  p  >  s  (2) 


Thus,  the  probability  of  rejecting  is  1  if  p  >  s;  it  lies  between  0  and  1  if 
p  <  s.  Accordingly,  we  take  the  value  p2,  representing  the  failure  rate 
above  which  we  want  to  abandon  the  experiment,  as  the  slope  s  of  the 
one-decision  sequential  test.  Then  if  a  is  the  risk  we  are  willing  to  run 
of  abandoning  the  experiment  and  rejecting  the  mechanism  when  the 
true  failure  rate  is  only  ph  we  may  substitute  pi  for  p,  a  for  Rp,  and  p2 
for  s  in  equation  (1)  and  solve  for  /i2.f 

2.2.  Adjusting  the  Rejection  Line  to  Take  Account  of  the  Acceptance  Line 

Having  determined  the  one-decision  sequential  test  that  yields  the 
requisite  probabilities  of  rejection,  we  now  take  into  account  the  accept- 
ance criterion  resulting  from  the  conditions  of  the  experiment.  This 
acceptance  criterion  affects  the  probabilities  in  two  respects.  (1)  The 
probabilities  of  rejection  will  be  decreased,  since  those  samples  which 

*  See  Statistical  Research  Group,  Sequential  Analysis,  Appendix  B,  equations 
(B.80)  and  (B.81).     (For  full  citations,  see  Sec.  5.) 

f  An  approximation  to  /i2  can  be  obtained  by  replacing  p-2  of  the  formula 


log  - 


hi  = 


&pi  \l  -  ViJ 
(Sequential  Analysis,  Sec.  2,  equation  2.204)  by  p2  +  (P2  —  pi)  =  2p2  —  pi. 
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would  lead  to  late  rejection  will  not  be  rejected  at  all  (in  fact,  will  now  be 
accepted);  (2)  for  every  value  of  p  <  1,  whether  or  not  p  >  s,  the 
probability  of  acceptance  will  be  greater  than  zero.  The  net  effect  of 
the  acceptance  line  is  thus  to  increase  the  chances  of  acceptance  for  all 
values  of  p  <  1.  This  means  that  Lp^  (=  1  —  RPJ  will  now  be  positive; 
while  Rp  will  be  less  than  a.  The  decrease  in  Rp  is,  of  course,  desirable, 
but  it  can  be  attained  only  at  the  cost  of  additional  testing  beyond  that 
necessary  to  maintain  Rp  at  the  required  level  a.  It  remains  therefore 
to  readjust  the  decision  lines  so  that  Rp  will  be  a  and  Lp  will  be  at  an 
acceptably  low  level,  say  (3. 

Since  the  acceptance  line  is  fixed  by  the  conditions  of  the  problem,  the 
necessary  changes  in  the  decision  lines  must  be  made  entirely  in  the 
rejection  line.  It  is  apparent  that  the  probabilities  of  rejection  can  be 
increased  either  by  reducing  h2  or  by  reducing  s,  or  both.  It  is  not  clear, 
however,  what  combination  of  adjustments  will  give  the  desired  risks 
a  and  (3  with  a  minimum  amount  of  inspection.  We  therefore  resort  to 
trial  and  error  to  find  a  combination  of  h2  and  s  that  gives  the  desired 
probabilities  of  acceptance,  leaving  unsettled  the  question  whether  this 
combination  of  h2  and  s  is  the  best  possible  from  the  standpoint  of  the 
average  amount  of  testing.  Fortunately,  this  trial-and-error  process  can 
be  carried  out  without  too  much  difficulty,  for  a  straightforward  method 
exists  for  computing  the  OC  curve  for  any  binomial  test  of  the  sort  here 
described.  Thus,  by  determining  the  OC  curve  corresponding  to  each 
of  a  number  of  combinations  of  h2  and  s,  that  combination  can  be  chosen 
which  gives  the  requisite  probabilities  of  acceptance  at  pi  and  p2.  This 
procedure  will  yield  a  plan  for  the  experiment  which,  though  probably 
not  optimum  from  the  standpoint  of  minimizing  inspection,  will  serve  as 
an  approximation  to  the  type  of  plan  desired,  that  is,  lead  to  early 
abandonment  of  the  experiment  if  the  true  failure  rate  is  high;  moreover, 
the  OC  curve  will  be  available,  so  the  consequences  of  using  the  plan  will 
be  fully  known. 

3.  AN  EXAMPLE 

To  illustrate  the  method  of  solution  outlined  in  Sec.  2,  let  us  suppose 
that  if  as  many  as  half  the  mechanisms  are  failures  we  wish  a  high  degree 
of  assurance  of  abandoning  the  experiment  and  rejecting  the  proposed 
model,  if  as  few  as  40  percent  are  failures  we  wish  a  high  degree  of  assur- 
ance of  completing  the  experiment.  Let  the  desired  probability  of 
completing  the  experiment  if  the  failure  rate  is  as  high  as  50  percent  be 
/3  =  0.10,  and  the  probability  of  abandoning  the  experiment  if  the  failure 
rate  is  as  low  as  40  percent  be  a  =  0.10. 

The  lower,  or  acceptance  line  is  determined  by  the  requirements  for 
estimating  the  performance  parameters,  as  explained  above;  its  equation 
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is 

di  =  -32  +  N  (3) 

where  d\  represents  the  cumulated  number  of  failures  and  N  represents 
the  number  of  tests  made. 

The  first  approximation  to  the  rejection  line  is  given  by  the  one- 
decision  sequential  test  for  which  s  =  0.5  and  h2  is  obtained  from 

0.4  =  -1 (4) 


0.1 


Writing 

x  =  0.1    *«  (5) 


we  have 


Then,  from  (5). 


X0.5    _    J  I 

0A  =  T^X  =  i^TT  (6) 

x°-b  =  1.5  "                              (7) 

a;  =  2.25  (8) 

log  x  ^  ' 
1 


log  2.25 

=  2.84 


A  moment's  reflection  on  the  orientation  of  the  line  d2  =  2.84  +  0.57V 
on  the  graph  relating  number  of  failures  to  number  of  items  tested  shows 
that  so  long  as  s  =  0.5,  all  values  of  h  satisfying  the  inequality  2.5  <  h  <  3 
are  equivalent,  since  observed  values  of  d  and  N  must  be  integers.  In 
view  of  this,  and  because  we  expect  this  first  approximation  to  yield  too 
large  probabilities  of  acceptance,  we  place  the  initial  value  of  h2  at  2.5. 
The  first  approximation  to  our  test  is  thus  defined  by  the  two  straight 
lines  with  equations 

d2  =  2.5  +  0.5N         (rejection  line)         (10) 
di  =  —  32  +  N        (acceptance  line)         (11) 

However,  this  combination  fails  to  give  a  satisfactory  OC  curve,  the 
values  of  Lp  being  excessive  throughout  the  range  of  p.  On  the  other 
hand,  the  combination  h2  =  1.6  and  s  =  0.4  gives  values  of  Lp  that  are 
too  small,  and  so  also  do  the  combinations  h2  =  1.5,  s  =  0.5  and  h2  =  2.2, 
s  =  0.4.  A  fifth  set,  K2  =  2.5  and  s  =  0.4,  gives  the  following  OC 
curve,  which,  though  departing  from  that  originally  specified,  appears 
satisfactory. 
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4.  CALCULATION  OF  OC  CURVE 

The  method  of  calculating  the  OC  curve  for  any  particular  experi- 
mental plan  of  the  type  described  is  discussed  at  length  elsewhere,*  so 
it  will  be  outlined  here  only  in  general  terms. 

The  basis  of  the  method  is  the  fact  that  in  any  truncated  sequential 
test,  the  number  of  possible  outcomes  in  sampling  is  finite  and,  in  practical 
cases,  fairly  small.  It  is  therefore  possible  to  enumerate  all  possible  out- 
comes of  the  test  and  to  determine  the  total  probability  of  those  outcomes 
which  lead  to  acceptance,  that  is,  the  value  of  Lp. 

This  calculation  is  accomplished  step  by  step,  as  follows:  For  the  first 
stage  of  the  experiment,  all  possible  ways  in  which  the  sample  might 
turn  out  are  specified,  and  the  probability  of  each  outcome  is  computed 
on  the  assumption  that  the  true  failure  rate  of  the  mechanism  is  some 
particular  value,  say  pi.  These  outcomes  are  then  classified  into  two 
groups,  those  which  lead  to  acceptance  and  those  which  lead  to  the 
decision  to  continue  testing.  The  sum  of  the  probabilities  of  the  first 
class  gives  the  total  probability  of  acceptance  at  the  first  stage  of  the 
experiment.  Each  outcome  in  the  second  class  is  then  considered  as  a 
possible  point  of  departure  for  the  next  stage  of  the  experiment;  and  for 
each  such  point  of  departure,  the  possible  outcomes  of  the  second  stage 
are  considered  and  classified  as  before.  The  sum,  over  all  points  of 
departure,  of  the  probabilities  of  those  outcomes  which  lead  to  acceptance 
at  the  second  stage  then  gives  the  total  probability  of  acceptance  at 
stage  two,  while  the  sum  of  such  probabilities  over  both  stages  gives  the 
total  probability  of  acceptance  for  the  first  two  stages.  This  basic 
method  of  calculation  can  be  carried  through  the  remaining  (finite  num- 
ber of)  stages  of  the  experiment. 

The  total  probability  of  acceptance  taken  over  all  stages  represents 
the  probability  Lp  that  the  experiment  will  lead  to  acceptance  when  the 
true  failure  rate  is  pi.  By  systematic  methods  these  calculations  can  be 
duplicated  for  a  choice  of  different  values  of  p,  so  that  a  sequence  of 
values  of  p  and  Lp  sufficient  to  determine  the  entire  OC  curve,  or  at  least 
those  points  of  principal  interest  on  the  OC  curve,  can  be  determined.  It 
is  in  this  fashion  that  the  points  on  the  OC  curve  in  Sec.  3  were  determined. 

*  Statistical  Research  Group,  Sampling  Inspection,  Chap.  17,  Sec.  2,  pp.  187-202. 
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This  method  of  attack  yields  only  one  of  many  possible  tests  of  the 
sort  described,  all  having  the  same  probabilities  LPi  =  0.90  and  Lp  =  0.10. 
The  optimum  test  may  thus  be  viewed  as  the  particular  one  of  these 
tests  which  requires  the  smallest  average  amount  of  inspection  for  the 
relevant  range  of  p.  By  sufficient  calculation  this  test  might  be  deter- 
mined by  trial  and  error,  since  the  probabilities  used  to  determine  the 
OC  curve  of  a  test  suffice  to  calculate  the  curve  representing  the  average 
amount  of  sampling  as  a  function  of  the  failure  rate  p  (the  ASN  curve).* 
If,  therefore,  we  were  to  determine  a  whole  class  of  tests  with  the  same 
Lp  and  Lp  ,  we  could  choose  that  one  whose  ASN  curve  showed  the  least 
average  cost  in  sampling  in  the  relevant  range  of  p.  However,  the 
volume  of  calculation  necessary  to  use  this  approach  renders  it  impractical. 
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CHAPTER  13 
PLANNING  EXPERIMENTS  SEEKING  MAXIMA 

1.  TYPE  OF  PROBLEM 

This  chapter  deals  almost  entirely  with  a  special  class  of  experiments, 
namely,  those  designed  to  determine  the  conditions  for  which  some  func- 
tion attains  a  maximum.  For  example,  experiments  may  be  conducted 
on  the  breaking  strength  of  a  type  of  alloy  to  determine  the  composition 
producing  maximum  strength.  It  is  not  the  primary  object  of  such 
experiments  to  determine  the  functional  relation  between  strength  and 
composition,  nor  even  to  measure  the  strength,  maximum  or  otherwise; 
the  primary  object  is  to  find  a  particular  combination  of  the  independent 
variables  at  which  the  maximum  of  the  dependent  variable  is  nearly 
attained.  Although  the  planning  of  experiments  has  become  a  major 
branch  of  statistics,  little  attention  has  been  given  to  this  important  class 
of  problem,  which  is  especially  common  in  engineering  and  development 
work.  * 

For  concreteness,  this  chapter  considers  the  specific  problem  of 
planning  a  metallurgical  experiment  to  find  the  best  composition  of  a 
certain  type  of  alloy  for  maximizing  the  time  to  rupture  under  a  given 
stress  at  a  given  elevated  temperature.  Clearly,  the  principles  elucidated 
in  terms  of  this  particular  problem  are  applicable  to  a  wide  class  of 
problems. 

2.  PURPOSES  OF  EXPERIMENT 

The  problem  considered  is  one  in  which  a  single  observable  result 
(number  of  hours  it  takes  a  specimen  to  rupture)  is  a  function  of  many 
independent  variables  (method  of  treating  the  alloy,  chemical  composi- 
tion of  the  allojr,  and  so  forth).  An  experiment  is  to  be  conducted  in 
which  the  result  is  to  be  measured  for  a  series  of  values  of  the  independent 
variables. 

Such  an  experiment  is  likely  to  have  two  somewhat  different  pur- 
poses: (1)  To  find  the  values  of  the  independent  variables  that  make 
the  function  (that  is,  the  observable  result)  a  maximum ;  (2)  to  evaluate 
the  function  in  the  neighborhood  of  the  maximum.  At  the  present  stage 
of  metallurgical  knowledge,  finding  the  maximum  is  usually  the  primary 
purpose.     There  is  reason  to  believe  that  nothing  like  the  maximum 

*  The  pathbreaking  work  on  the  problem  is  that  of  Hotelling,  "  Experimental 
Determination  of  the  Maximum  of  a  Function. "      (For  full  citations,  see  Sec.  6.) 
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possible  strength  has  yet  been-  attained;  and  the  theory  of  the  subject 
is  so  undeveloped  as  to  make  direct  experiment  the  most  fruitful  approach 
toward  finding  stronger  alloys.  Moreover,  there  is  relatively  little 
interest  in  knowing  the  behavior  of  the  function  in  regions  rather  distant 
from  the  maximum. 

If  the  function  has  more  than  one  maximum,  the  ultimate  objective 
is  to  find  the  largest  maximum — the  absolute  maximum.  In  general, 
however,  this  objective  is  unattainable  without  complete  knowledge  of 
the  entire  function.  Such  knowledge  can  seldom  be  secured  experi- 
mentally, and  almost  certainly  not  in  a  single  experiment.  The  dis- 
cussion that  follows  therefore  assumes  a  more  modest  objective,  the 
determination  of  a  local  maximum. 

Although  the  location  of  a  maximum  is  of  primary  importance, 
mapping  the  function  in  the  neighborhood  of  a  maximum  is  important 
for  several  reasons: 

(1)  Typically,  maximization  of  the  particular  dependent  variable 
measured  in  a  given  experiment  is  only  one  of  several  desiderata  in 
choosing  an  alloy,  so  it  is  important  to  know  how  much  would  be  sacri- 
ficed if  the  independent  variables  were  altered  to  satisfy  other  require- 
ments. For  example,  the  combination  of  independent  variables  that 
gives  the  highest  time  to  rupture  may  give  insufficient  ductility.  It 
might  then  be  desirable  to  know  how  much  strength  would  be  sacrificed 
by  various  changes  designed  to  increase  ductility. 

(2)  Knowledge  of  the  relative  importance  of  different  independent 
variables  is  valuable  in  designing  other  experiments  to  find  other  maxima. 

(3)  Knowledge  of  the  function  in  the  region  of  a  maximum  may 
contribute  to  an  understanding  of  the  mechanism  at  work  and  may  thus 
advance  theoretical  knowledge. 

(4)  It  may  not  be  possible  to  control  the  independent  variables  as 
closely  in  production  as  in  laboratory  experiments.  Knowledge  of  the 
function  in  the  neighborhood  of  a  maximum  will  make  it  possible  to 
judge  the  seriousness  of  departures  from  control. 

3.  DEFICIENCIES  OF  FACTORIAL  DESIGN 

The  factorial  design  is  ideally  suited  for  experiments  whose  purpose 
is  to  map  a  function  in  a  preassigned  range  of  values  of  the  independent 
variables.*     The  fact  that  the  region  is  preassigned  means  that  it  is 

*  The  term  "factorial  design"  is  used  in  this  chapter  to  refer  to  an  experimental 
design  in  which  a  limited  number  of  levels  of  each  independent  variable  are  selected 
in  advance  and  at  least  one  test  is  made  for  each  combination  of  the  selected  levels  of 
the  independent  variables.  See  Fisher,  Design  of  Experiments,  Chap.  VI,  The 
Factorial  Design  in  Experimentation. 

The  factorial  design  is  in  a  sense  the  extreme  example  of  a  class  of  designs  having 
in  common  the  selection  in  advance  of  the  values  of  the  independent  variables  at  which 
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feasible  to  lay  out  the  design  completely  and  specifically  in  advance 
of  the  experiment;  the  fact  that  the  behavior  of  the  function  is  desired 
for  all  possible  combinations  of  the  independent  variables  means  that 
the  experiment  must  represent  all  combinations  in  some  way,  which  a 
factorial  design  does. 

For  the  purposes  described  in  the  preceding  section,  however,  the 
factorial  design  has  certain  deficiencies: 

(1)  It  devotes  observations  to  exploring  regions  that  may  turn  out, 
in  the  light  of  the  results,  to  be  of  no  interest  because  they  are  far  from  a 
maximum. 

(2)  Because  it  seeks  to  explore  a  given  region  comprehensively,  it 
must  either  explore  only  a  small  region,  or  explore  a  large  region  very 
superficially.  If  a  small  region  is  explored,  it  may  turn  out  to  contain  no 
local  maximum.  The  experiment  would  then  at  best  indicate  the  direc- 
tion in  which  to  move  in  conducting  a  new  experiment.  If  a  large 
region  is  explored,  it  may  be  explored  so  superficially  that  a  maximum 
may  be  missed  entirely,  or,  if  found,  will  be  known  so  imperfectly  that  an 
additional  experiment  is  required. 

(3)  The  factorial  design  has  the  disadvantage,  somewhat  less  funda- 
mental than  the  preceding,  that  it  neglects  the  fact  that  some  of  the 
variables  are  continuous  and  some,  though  discrete,  are  ordered.  This 
deficiency  could,  however,  be  remedied  while  retaining  the  basic  prin- 
ciples of  the  factorial  design. 

4.  A  SEQUENTIAL  DESIGN 

These  deficiencies  of  the  factorial  design  suggest  that  an  efficient 
design  for  the  present  purpose  ought  to  be  sequential;  that  is,  ought  to 
adjust  the  experimental  program  at  each  stage  in  the  light  of  the  results 
of  prior  stages.  This  section  outlines  a  particular  sequential  design, 
primarily  to  illustrate  the  direction  that  seems  desirable  in  the  design  of 
metallurgical  experiments  of  the  type  here  considered. 

Step  1.  Select  some  particular  combination  of  independent  variables 
as  a  starting  point.  The  combination  selected  should  be  the  best  advance 
estimate  of  the  optimum  combination. 

observations  are  to  be  made.  Other  designs  in  this  class  may,  in  any  particular 
instance,  be  more  economical  than  a  complete  factorial  and  preferable  to  it.  Economy 
can  be  gained  without  loss,  however,  only  if  additional  knowledge  is  available  in 
advance  (for  example,  knowledge  that  certain  interactions  are  negligible). 

Many  of  the  remarks  in  this  chapter  are  applicable  to  these  other  designs  as  well 
as  to  the  complete  factorial.  However,  the  chapter  considers  explicitly  only  the 
complete  factorial  design  in  order  to  have  a  sharp  and  definite  contrast  and  to  avoid 
complications  and  qualifications  that  have  little  bearing  on  the  main  issue  raised  in 
this  chapter. 
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Step  2.     Arrange  the  independent  variables  in  some  order. 

Step  3.  Make  tests  at  the  original  starting  point  and  at  a  series  of 
points  differing  only  in  the  value  assigned  the  first  independent  variable, 
the  other  variables  being  held  constant.*  From  this  series  of  experi- 
ments, estimate  the  value  of  the  first  independent  variable  that  maximizes 
time  to  rupture. f  If  the  independent  variable  is  continuous,  it  will 
probably  be  best  to  find  the  maximum  by  some  such  device  as  fitting  a 
polynomial  to  the  observations. 

Step  4.  Holding  the  first  independent  variable  fixed  at  the  maximiz- 
ing value  found  in  step  3  and  the  other  independent  variables,  except  the 
second,  at  the  values  originally  assigned  to  them,  repeat  step  3  with  the 
second  independent  variable.  Repeat  this  process  with  the  third,  fourth, 
.  .  .  ,  independent  variable. 

Step  5.  After  all  variables  have  been  varied  (call  this  a  " round"  of 
experiments),  the  whole  process  can  be  repeated,  except  that  the  original 
starting  point  is  replaced  by  the  combination  of  values  of  the  independent 
variables  reached  at  the  end  of  the  first  round. 

Successive  rounds  will  clearly  converge  to  the  neighborhood  of  a  local 
maximum,  with  perhaps  a  few  " pathological"  exceptions;  for,  aside  from 
statistical  fluctuations,  each  round  and  each  step  within  a  round  neces- 
sarily represent  an  improvement.  The  rate  at  which  the  improvements 
decline  from  round  to  round  is  likely  to  give  some  idea  of  how  much 
further  progress  is  possible  and  some  indication  of  the  value  of  additional 
rounds.  The  process  of  convergence  may  perhaps  be  speeded  up  after 
one  or  more  complete  rounds.     For  example,  after  the  second  complete 

*  For  the  present  purpose  the  precise  meaning  to  be  attached  to  "the  other  vari- 
ables being  held  constant"  is  irrelevant.  For  variables  representing  chemical 
composition,  it  can  be  interpreted  as  meaning  that  the  relative  proportions  of  the  other 
variables  are  held  constant.  Alternatively,  for  all  but  one  variable,  it  can  be  inter- 
preted as  meaning  that  the  percentage  a  particular  element  forms  of  the  total  is  held 
constant.  Any  combination  of  these  two  will  also  do,  provided  the  method  of  holding 
other  variables  constant  is  followed  consistently. 

f  If  the  independent  variable  is  ordered,  this  process  also  could  be  sequentialized, 
with  obvious  advantages.  For  example,  tests  could  be  made  (a)  at  the  original  start- 
ing point  and  (6)  at  a  single  other  value  of  the  first  independent  variable.  If  the 
tests  at  (6)  gave  a  (significantly)  higher  time  to  rupture  than  the  tests  at  (a),  other 
tests  would  be  made  by  moving  still  farther  in  the  same  direction  from  the  original 
starting  point.  If  the  tests  at  (6)  gave  a  shorter  time  to  rupture,  the  next  tests  would 
be  made  by  moving  in  the  opposite  direction  from  the  starting  point.  This  process 
would  be  continued  until  a  value  was  found  higher  than  its  neighbors  in  both  direc- 
tions. This  aspect  of  a  sequential  procedure  is  not  considered  in  detail  here  because 
it  is  not  germane  to  our  main  point  and  because  practical  considerations  having  to  do 
with  full  use  of  experimental  equipment  may  make  it  undesirable  to  sequentialize  the 
experiment  fully.  Even  though  this  phase  of  the  experiment  is  not  sequentialized, 
there  is  a  real  problem  about  designing  it  efficiently.  See  Hotelling,  "Experimental 
Determination  of  the  Maximum  of  a  Function." 
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round  it  might  be  desirable  to  move  along  the  vector  defined  by  the  end 
points  of  the  first  two  rounds,  instead  of  varying  the  variables  according 
to  the  original  pattern.  After  one  such  step,  the  original  pattern  could 
be  used  again. 

It  will  clearly  be  desirable  in  the  early  rounds  to  use  rather  broad 
intervals  in  varying  each  independent  variable  and  to  locate  the  maximum 
only  very  approximately.  Finer  and  finer  intervals  should  be  used  in 
later  rounds  as  the  maximum  is  approached.  A  consequence  of  this  is 
that  the  process  will  provide  only  a  very  rough  mapping  of  the  function 
in  regions  far  removed  from  a  maximum,  but  a  relatively  fine  mapping  in 
regions  near  a  maximum.  The  mapping  of  the  region  in  the  neighbor- 
hood of  a  maximum  can,  if  desired,  be  filled  in  by  a  supplementary 
experiment,  perhaps  of  factorial  design,  after  the  neighborhood  of  the 
maximum  has  been  delimited  with  sufficient  accuracy. 

It  may  be  worth  repeating  that  nothing  more  is  claimed  for  this  out- 
line of  a  sequential  design  than  that  it  is  suggestive.  Numerous  prob- 
lems would  arise  in  applying  it  to  an  actual  experiment,  and  there  are 
many  possibilities  of  increasing  its  efficiency.  The  suggested  sequential 
design  is  in  large  part  simply  an  abstract  and  systematic  statement  of  a 
rather  natural,  obvious,  and  common  method  of  locating  a  maximum  or 
minimum  of  a  function  of  many  variables. 

5.  COMPARISON  OF  FACTORIAL  AND  SEQUENTIAL  DESIGNS 
5.1.  Number  of  Observations 

The  great  advantage  of  a  sequential  design  over  a  factorial  design  is 
that  it  will  concentrate  observations  in  the  neighborhood  of  a  maximum. 
This  advantage  is  likely  to  be  greater,  the  larger  the  number  of  inde- 
pendent variables  and  the  larger  the  number  of  values  of  each  independent 
variable  at  which  observations  are  taken.  Let  n  be  the  number  of 
variables  and  suppose  that  observations  are  to  be  taken  at  k  values  of 
each  variable  (for  the  sequential  design,  these  need  not  be  the  same 
values  in  successive  rounds).  Then  a  complete  factorial  design  without 
replication  will  require  kn  observations.  A  single  round  of  the  sequential 
design  without  replication  will  require  kn  observations.  The  observa- 
tions required  for  a  factorial  design  will  therefore  permit  kn/kn  =  kn~1/n 
rounds,  which  increases  with  n  and  with  k.  The  number  of  rounds 
required  under  the  sequential  design  will  decrease  as  k  increases,  since  the 
maximum  at  each  step  would  be  determined  more  accurately.  We  do  not 
know  how  the  number  of  rounds  required  under  the  sequential  design 
varies  with  n — nor  indeed  is  it  easy  to  see  how  to  state  this  question 
precisely.  We  expect,  however,  that  the  number  of  rounds  required  will 
not  depend  very  sharply  on  n  but  will  depend  primarily  on  (a)  the  close- 
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ness  of  the  original  starting  point  to  the  optimum  combination  and  (b) 
the  size  of  the  experimental  error  relative  to  the  differences  it  is  consid- 
ered important  to  detect. 

Consider  a  simple  5X3X3X3  factorial  design,  that  is,  one  involv- 
ing four  independent  variables,  with  five  values  of  the  first  and  three 
values  of  each  of  the  others.  Such  a  design  requires  135  observations, 
without  providing  for  replications.  A  single  round  of  the  sequential 
design,  covering  the  same  number  of  values  of  each  independent  variable, 
would  require  14  observations.  It  would  therefore  be  possible  to  make 
almost  10  complete  rounds  with  the  number  of  observations  required  for 
the  factorial  experiment.  Fewer  than  10  rounds  should  suffice  if  the 
original  starting  point  is  reasonably  close  to  the  maximum.  More  rounds 
might  be  required  if  the  original  starting  point  were  far  from  a  maximum ; 
but  if  so  little  information  about  the  location  of  a  maximum  were  avail- 
able, a  factorial  experiment  might  well  prove  fruitless. 

As  another  example,  consider  an  experiment  to  determine  the  opti- 
mum composition  of  the  high-carbon  vitallium-type  alloys  with  respect 
to  elements  that  have  been  included  in  various  tests  of  this  type  of 
alloys.  At  least  ten  elements  have  been  included  in  one  or  another  trial 
composition.  This  gives  nine  independent  variables,  since  the  sum  of  the 
percentages  must  add  up  to  100.  Suppose  we  are  willing  to  use  only 
three  levels  of  each  element.  A  complete  factorial  design  without 
replications  would  then  require  39,  or  19,683,  observations.  A  single 
round  of  the  sequential  design  without  replications  would  require  only 
27  observations,  so  that  it  would  be  possible  to  make  729  rounds  with 
the  number  of  observations  required  for  a  complete  factorial  design. 
Five  rounds,  with  duplicate  observations  at  each  level  on  each  round, 
would  require  only  270  observations,  or  less  than  1.5  per  cent  of  the 
number  required  for  a  complete  factorial  design  without  replications. 
As  this  example  indicates,  the^  advantage  of  the  sequential  design  in 
terms  of  number  of  observations  is  likely  to  become  extremely  great  as  the 
number  of  variables  increases.  And,  it  should  be  noted,  the  metal- 
lurgical problem  in  question  does  in  fact  involve  many  variables.  The 
example  cited  is  certainly  conservative,  since,  in  addition  to  the  ten 
chemical  elements,  the  strength  of  the  alloy  is  likely  to  be  affected  by  the 
solution  and  aging  treatments,  the  methods  of  casting  or  of  forging,  and 
so  forth. 

It  should  be  recognized  that  the  additional  observations  obtained 
in  the  factorial  experiment  are  not  useless.  They  provide  information 
about  the  behavior  of  the  function  in  regions  removed  from  a  maximum, 
about  the  relative  importance  of  the  variables,  and  about  their  inter- 
actions. This  information  is  likely  to  be  valuable  in  choosing  new 
regions  within  which  to  search  for  maxima  and  in  deciding  on  the  vari- 
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ables  on  which  to  concentrate  in  preliminary  experimentation.  More 
important  still,  the  observations  some  distance  from  a  maximum  may, 
under  some  circumstances,  tell  something  about  the  behavior  of  time  to 
rupture  in  the  neighborhood  of  that  maximum;  perhaps  the  most  com- 
mon circumstances  when  this  occurs  are  when  (a)  interactions  are  small, 
or  (b)  there  is  available  a  fairly  good  approximation  to  the  mathematical 
form  of  the  function,  involving  relatively  few  parameters. 

With  respect  to  proviso  (a),  work  on  metallurgical  problems  suggests 
that  interactions  rarely  are  small.  Furthermore,  while  the  efficiency 
of  a  factorial  design  for  finding  a  maximum  is  greater  if  there  are  no 
interactions  than  if  there  are  interactions,  so  is  that  of  a  sequential 
design.  If  there  are  no  interactions,  a  single  round  of  the  sequential 
experiment  will,  except  for  statistical  fluctuations,  lead  directly  to  the 
maximum.  Statistical  fluctuations  mean  that  successive  rounds  will  not 
lead  to  exactly  the  same  point ;  but  they  will  tend  to  give  much  the  same 
time  to  rupture,  and  it  should  become  clear  fairly  quickly  that  one  is  in 
the  immediate  neighborhood  of  a  maximum.  Moreover,  in  the  absence 
of  interactions,  data  from  successive  rounds  can  be  combined  to  give  a 
better  estimate  of  the  maximum. 

Proviso  (6),  that  there  be  available  a  good  approximation,  involving 
relatively  few  parameters,  to  the  mathematical  form  of  the  function,  is 
rarely  satisfied.  But  if  it  is,  neither  the  standard  factorial  design  nor  the 
sequential  design  suggested  in  Sec.  4  is  appropriate.  The  problem  then 
is  to  determine  as  accurately  as  possible  the  values  of  the  independent 
variables  that  maximize  this  function.  These  maximizing  values  depend 
upon  the  parameters  of  the  function;  the  optimum  design  for  this  pur- 
pose thus  depends  on  the  form  of  the  function  and  on  the  approxi- 
mate values  of  its  parameters.  Probably  the  best  plan  is  to  conduct  a 
small  preliminary  experiment  to  get  a  general  idea  of  the  values  of  the 
parameters  and  to  use  this  information  in  designing  a  larger  experiment.  * 

5.2.  Time  Trends 

One  real  advantage  of  the  factorial  design  is  that  it  is  possible,  through 
proper  randomization,  to  prevent  the  introduction  of  bias  from  any  time 
trends  that  may  be  present.  Time  trends  will,  indeed,  reduce  the  power 
of  the  factorial  experiment  to  detect  the  effects  that  it  is  designed  to 
measure,  but  they  will  not  bias  the  conclusions.  The  sequential  plan,  on 
the  other  hand,  is  very  much  at  the  mercy  of  time  trends;  and  great  care 
would  have  to  be  taken  to  see  that  none  crept  in.  Measures  that  might 
be  adopted  would  include  (a)  running  control  specimens  of  constant 
composition  along  with  the  test  specimens;  (b)  keeping  a  running  record 

*  See  Hotelling,  "Experimental  Determination  of  the  Maximum  of  a  Function." 
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of  differences  between  duplicate  specimens  to  detect  changes  in  experi- 
mental error. 
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CHAPTER  14 

PROBABILITY  THAT  SAMPLE  MEANS  ARE  IN  OPPOSITE  ORDER 
TO  POPULATION  MEANS 

1.  INTRODUCTION 

When  N  objects  are  drawn  independently  at  random  from  each  of 
two  populations  in  which  some  measurable  characteristic  x  of  the  objects 
is  distributed  about  mean  values  mi  and  m2,  what  is  the  probability  that 
the  larger  of  the  two  sample  means  xi  and  x2  will  correspond  with  the 
population  having  the  smaller  mean?  In  other  words,  if  a  decision 
about  the  sign  of  mx  —  m2  is  based  on  the  sign  of  xi  —  x2,  what  is  the 
probability  that  the  decision  will  be  wrong? 

Such  a  question  arises  if  two  alternative  sources  Si  and  S2  (for  exam- 
ple, processes  for  making,  procedures  for  treating,  genetic  strains)  yield 
objects  of  a  certain  kind,  and  the  worth  of  each  object  is  determined  by 
the  value  of  some  measurable  characteristic  x  that  varies  about  an  aver- 
age mi  for  objects  from  Si  and  about  an  average  m2  for  objects  from  >S2. 
If  mi  and  m2  were  known,  a  decision  to  adopt  Si  or  S2  would  be  deter- 
mined (completely)  by  the  sign  of  mi  —  m2;  that  is,  Si  or  S2  would  be 
chosen  according  as  mx  exceeds  or  does  not  exceed  m2.  When  mx  and  m2 
are  unknown,  it  is  natural  to  base  the  decision  on  the  sign  of  xi  —  x2, 
where  Xi  and  x2  are  the  average  values  of  x  for  N  objects  drawn  inde- 
pendently at  random  from  Si  and  S2,  respectively.  Such  decisions  will 
be  wrong,  that  is,  opposite  to  the  decision  that  would  be  made  if  mx  and 
m2  were  known,  whenever  the  sign  of  xi  —  x2  is  opposite  to  that  of 
mi  —  m2.  Consequently,  it  is  of  interest  to  know  the  probability  of  such 
an  occurrence  and  how  this  probability  depends  on  the  sample  size  N 
and  on  the  absolute  magnitude  of  the  true  difference,  |rai  —  m2|. 

In  Sec.  2  the  probability  of  xi  —  x2  having  its  sign  opposite  to  that 
of  mi  —  m2  is  evaluated,  and  its  dependence  on  N  and  |rai  —  m2|  studied, 
for  the  case  where  x  is  normally  distributed  about  mi  with  a  standard 
deviation  of  <n  for  objects  from  Si,  or  about  m2  with  standard  deviation 
o-2  for  objects  from  S2.  In  Sec.  3  attention  is  given  to  the  question  of 
how  large  N  should  be  in  order  that  the  risk  of  a  wrong  decision  be  less 
than  /?  whenever  |m2  —  mi|  exceeds  D,  say.  The  results  given  in  Sees.  2 
and  3  are  applicable,  to  a  good  approximation,  in  cases  where  x  is  non- 
normally  distributed  for  objects  from  Si  or  S2,  provided  the  distributions 
of  x  have  finite  standard  deviations,  which  is  generally  the  case  in  prac- 
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tical  work,  and  N  is  sufficiently  large.  For  sample  sizes  of  4  or  larger 
the  approximation  will  generally  be  adequate  for  most  practical  purposes 
when  a i  and  a2  are  known.  When  <n  and  a2  are  unknown  and  N  is  not 
less  than  30,  <n  and  a2  may  be  replaced  by  their  estimates  from  the 
samples 


(1 4 


N(N  -  1) 


(«  =  1,  2) 


(1) 


and  the  results  of  Sees.  2  and  3  will  not  be  seriously  affected.  In  Sec.  4, 
the  conditions  under  which  the  sign  of  xi  —  x2  provides  the  "best" 
decision  function  for  choosing  between  Si  and  S2  are  stated,  and  the 
problem  of  choosing  a  best  decision  function  in  other  circumstances  is 
discussed  briefly. 


2.  PROBABILITY  FUNCTION 

Let  xi  and  x2  denote  the  means  of  N  observations  drawn  independently 
at  random  from  each  of  two  normal  populations  with  means  mi  and  m2 
and  standard  deviations  <j\  and  a2,  respectively.  Then  in  repeated 
sampling  x\  —  x2  will  be  normally  distributed  about  a  mean  mi  —  m2 

TABLE  14.1 

Probability  P(8,N)  That  Sample  Means  Will  Be  in  Opposite  Order  to 

Population  Means  for  Selected  Values  of  5  and  N 


Absolute  difference 

Nu 

mber  of  observations  in  each  sample 

{N) 

between  population 

means  in  a  units* 

(5) 

2 

4 

6 

8 

10 

12 

16 

20 

50 

100 

200 

0.00 

0.500 

0.500 

0.500 

0.500 

0.500 

0.500 

0.500 

0.500 

0.500 

0.500 

0.500 

0.05 

0.480 

0.472 

0.465 

0.460 

0.455 

0.451 

0.444 

0.437 

0.401 

0.362 

0.309 

0.10 

0.460 

0.444 

0.431 

0.421 

0.412 

0.403 

0.389 

0.376 

0.309 

0.240 

0.159 

0.15 

0.440 

0.416 

0.398 

0.382 

0.369 

0.357 

0.336 

0.318 

0.227 

0.144 

0.067 

0  20 

0.421 

0.389 

0.365 

0.345 

0.327 

0.312 

0.286 

0.264 

0.159 

0.079 

0.023 

0.25 

0.401 

0.362 

0.333 

0.309 

0.288 

0.270 

0.240 

0.215 

0.106 

0.039 

0.006 

0.30 

0.382 

0.336 

0.302 

0.274 

0.251 

0.231 

0.198 

0.171 

0.067 

0.017 

0.001 

0.40 

0.345 

0.286 

0.244 

0.212 

0.186 

0.164 

0.129 

0.103 

0.023 

0.002 

0.0000 

0.50 

0.309 

0.240 

0.193 

0.159 

0.132 

0.110 

0.079 

0.057 

0.006 

0.0002 

0.75 

0.227 

0.144 

0.097 

0.067 

0.047 

0.033 

0.017 

0.009 

0.0001 

0.0000 

1.00 

0.159 

0.079 

0.042 

0.023 

0.013 

0.007 

0.002 

0.001 

0.0000 

1.25 

0.106 

0.039 

0.015 

0.006 

0.003 

0.001 

0.0002 

0.0000 

1.50 

0.067 

0.017 

0.005 

0.001 

0.0004 

0.0001 

0.0000 

1.75 

0.040 

0.007 

0.001 

0.0002 

0.0000 

0.0000 

2.00 

0.023 

0.002 

0.0003 

0.0000 

2.50 

0.006 

0.0002 

0.0000 

3.00 

0.001 

0.0000 

*  Let  mi  and  <ri  denote  the  mean  and  standard  deviation  of  the  first  population,  mi  and  <n  denote  the 
mean  and  standard  deviation  of  the  second  population,  and  let  <r2  =  (<m2  +  <T22)/2;  then  5  =  \m\  —  mi\/a. 
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with  standard  deviation  a  v  2/N  where 

2  _  <r\  +  a\ 


(2) 


Consequently,  the  probability  that  the  sign  of  x\  —  %%  will  be  opposite 
to  that  of  mi  —  m2  is 


where 


and 


P(5,N)  =  -4=   f  °°  e   ~2dt 

V27T    J  a 


\nti  —  m2\ 


a=5Vf 


(3) 
(4) 
(5) 


P(5,N)  is  tabulated  to  3  decimals  (occasionally  4)  in  Table  14.1  for 
a  =  0  to  0.30  by  steps  of  0.05,  0.30  to  0.50  by  steps  of  0.10,  0.50  to  2.00 
by  steps  of  0.25,  2.00  to  3.00  by  steps  of  0.50;  and  for  N  =  2  to  12  by 


Probability 
.50 


.45 


.40 


,35 


.30 


.25 


20 


.15 


.10 


.05 


N   =  the  number  of  independent  observations  from 

each  population 
//,  =  mean  of  the  first  population 
uz  =  mean  of  the  second  population 
o",  =  standard  deviation  of  the  first  population 
crz  =  standard  deviation  of  the  second  population 

6  =         <r 
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~-u\ 
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Fig.   14.1. — Probability  that  means  of  samples  from  two  normal  populations  will  be  in 
opposite  order  to  population  means. 
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steps  of  2,  16,  20,  50,  100,  200.  For  example,  P(0.50,10)  =  0.132,  which 
signifies  that,  with  samples  of  10  observations  from  each  of  two  popula- 
tions whose  means  differ  by  a/2,  the  probability  of  the  sample  means 
being  in  opposite  order  to  the  population  means  is  0.132.  This  event 
may  be  expected  to  occur,  therefore,  more  than  1  time  out  of  8  in  the 
long  run,  under  such  circumstances.  Clearly  it  is  risky  to  base  decisions 
on  the  sign  of  xi  —  x2  for  samples  of  only  10  observations  each  when 
differences  of  a/2,  or  less,  between  population  means  are  of  practical 
importance. 

In  order  to  portray  better  the  dependence  of  P(8,N)  on  8  and  on  N, 
P(8,N)  has  been  plotted  against  8  in  Fig.  14.1  for  N  =  2,  4,  10,  20,  50, 
100,  and  200.  From  these  curves  it  is  clear  that  decisions  based  on  the 
sign  of  xi  —  x2  will  rarely  be  incorrect  when  |rai  —  m2\  exceeds  2a  (8  =  2), 
even  for  samples  of  only  4  observations  each.  On  the  other  hand,  if  an 
incorrect  decision  is  of  practical  importance  when  |mi  —  m2\  =  a/2,  and 
if  it  is  desired  that  the  probability  of  an  incorrect  decision  not  exceed 
0.01  under  these  circumstances,  then  about  50  observations  from  each 
population  will  be  needed  as  a  basis  for  decisions. 

3.  SAMPLE  SIZES 

If  we  choose  a  probability  &  of  making  an  incorrect  decision  (a 
decision  based  on  the  sign  of  Xi  —  x2  different  from  that  which  would  be 
made  if  the  sign  of  mi  —  ma  were  known)  and  define  Ke  by 

1      e    2dt  =  e  (6) 


V2tt  J  Kt 

then,  from  (3),  (4),  and  (5)  it  follows  that 

K^pi^!^  (7) 

determines,  for  a  given  N  and  a,  the  value  of  |rai  —  m2\  for  which  the  prob- 
ability of  an  incorrect  decision  is  j8;  or,  for  a  given  value  of  |mi  —  m2\/a, 
the  value  of  N  that  will  be  needed  if  the  probability  of  an  incorrect 
decision  is  to  be  /3.  Values  of  K$  can  be  obtained  from  Table  1.1  of 
Chap.  1,  or  values  of  0  corresponding  with  various  values  of  K  from 
Table  1.5. 

For  example,  if  an  incorrect  decision  is  serious  when  |mi  —  m2\  =  a/2, 
and  if  it  is  desired  to  limit  the  risk  of  such  an  error  to  0.01  under  these 
circumstances,  then  (7)  implies  that  at  least  44  (8  X  2.32632  =  43.3) 
observations  will  be  needed  from  each  of  the  populations.  On  the  other 
hand,  if  the  sampling  has  already  been  carried  out,  taking  only  30 
observations   from   each   population,   it   may   be   some   consolation   to 


Sec.  4  SAMPLE,  POPULATION  MEANS  IN  OPPOSITE  ORDER        381 

know  that  the  risk  of  an  incorrect  decision  is  only  0.05  when 

|mi  -  m2|  =  1.6449c/Vl5  =  0.42o- 
and  is  0.01  when 

|mi  -  m2\  =  2.3263d/ VT5  =  O.6O0-. 

4.  CONDITIONS  UNDER  WHICH  THE  SIGN  OF  xi  -  x2  PROVIDES  THE  BEST 

DECISION  FUNCTION 

In  Sec.  1  it  was  stated  that  if  a  decision  to  choose  Si  or  S2  would  be 
determined  (completely)  by  the  sign  of  m\  —  m2  if  this  sign  were  known, 
it  is  natural  to  base  the  decision  on  the  sign  of  X\  —  x2.  This  raises  the 
question  of  whether  the  sign  of  X\  —  x2  is  actually  the  best  function  of  the 
data  upon  which  to  base  the  decision. 

H.  A.  Simon  has  shown  that  when  (a)  x  is  normally  distributed  in 
each  of  the  populations  Si  and  S2,  (b)  the  standard  deviation  of  x  is  the 
same  (<ri  =  <r2)  in  $1  and  S2,  (c)  there  is  no  a  priori  basis  for  a  choice 
between  Si  and  S2  without  knowledge  of  the  values  of  mx  and  m2  (or,  at 
least,  of  the  sign  of  mi  —  m2),  and  (d)  the  choice  between  Si  and  >S2  is  to 
be  based  on  some  symmetric  function  of  the  observed  values  of  an  equal 
number  N  of  observations  drawn  independently  at  random  from  Si  and 
S2,  then  the  probability  of  an  incorrect  decision  will  be  a  minimum,  what- 
ever the  values  of  mi  and  m2  and  of  the  common  value  of  ci  and  a2,  if  the 
decision  to  choose  Si  or  S2  is  based  on  the  sign  of  Xi  —  x2.  As  we  have 
seen  above,  the  probability  of  an  incorrect  decision,  when  the  sign  of 
Xi  —  x2  provides  the  basis  for  decision,  is  determined  completely  by  N, 
a  =  <ti  =  c2,  and  \rrii  —  m2|  under  these  conditions. 

When  conditions  (a),  (c),  and  (d)  above  are  fulfilled,  but  cri  ^  <j2, 
then,  Simon  has  found,  there  does  not  exist  any  symmetric  function  of  the 
2N  observations  having  the  property  that  when  the  decision  to  choose 
between  Si  and  S2  is  based  on  the  observed  value  of  this  function  the 
probability  of  an  incorrect  decision  will  be  a  minimum  whatever  the 
values  of  mi  and  m2;  this  is  true  even  when  the  values  of  d  and  a2  are 
known.  This  is  the  case  a  fortiori  when  the  value  of  either  <ri  or  c2  is 
unknown.  He  has  found,  however,  that  if  it  is  agreed  that  (df)  the  choice 
between  $1  and  S2  is  to  be  based  on  some  symmetric  function  of  the 
differences  Xn  —  x2i,  Xi2  —  x22,  '  '  '  ,  %in  ~~  %2n,  then  the  sign  of 
Xi  —  x2  is  the  decision  function  that  minimizes  the  probability  of  error, 
whatever  the  values  of  mx  and  m2  and  of  0-1  and  a2. 

When  x  is  non-normally  distributed  in  one  or  both  of  the  populations, 
then  Xi  —  x2  no  longer  furnishes  the  most  efficient  estimate  of  mx  —  m2 
and  the  sign  of  Xi  —  x2  will  not  be  the  best  decision  function  upon  which 
to  base  a  choice  between  $1  and  S2,  even  when  ai  =  <r2.  Welch  has  dis- 
cussed the  determination  of  the  best  discriminant  function  upon  which 
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to  base  a  choice  between  two  populations  when  the  form  of  the  distribu- 
tion of  x  in  each  of  the  populations  is  known. 
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CHAPTER  15 

SIGNIFICANCE  OF  THE  LARGEST  OF  A  SET  OF 
SAMPLE  ESTIMATES  OF  VARIANCE 

1.  TYPE  OF  PROBLEM 

When  data  relate  to  several  different  processes,  to  several  different 
products,  to  a  single  process  at  different  times,  or  to  a  particular  type  of 
product  from  each  of  several  different  sources,  the  analyst  is  frequently 
faced  with  the  necessity  of  deciding  whether  the  observed  differences  in 
variability  of  the  data  within  the  respective  classes  are  consistent  with 
the  supposition  that  the  intrinsic  variability  is  uniform,  that  is,  the  same 
for  each  class. 

If  the  classes  are  not  uniform  in  variability,  then  simple  comparisons 
of  the  classes  in  other  respects  may  be  misleading.  Thus,  if  the  within- 
class  variability  differs  greatly  from  class  to  class,  the  mean  values  for 
the  respective  classes  of  the  variable  under  consideration  no  longer  pro- 
vide an  adequate  basis  for  comparison,  since  a  small  mean  associated  with 
a  large  dispersion  may  imply  greater  frequency  of  occurrence  of  undesir- 
able values  of  the  characteristic  than  will  a  larger  mean  associated  with  a 
smaller  dispersion  of  individual  values.  In  such  cases  it  may  be  more 
useful  to  compare  the  classes  with  respect  to  the  probability  of  occurrence 
of  an  individual  value  greater  than  a  specified  upper  limit  U  (as  in 
Chap.  1)  or  with  respect  to  some  other  feature  that  takes  into  account 
both  the  average  value  and  the  variability  of  the  characteristic  in  each 
class.  In  this  connection  it  should  be  recalled  that  the  customary  F  test 
of  the  homogeneity  of  class  means,  which  consists  of  comparing  the 
observed  ratio  of  the  between-class  mean  square  to  the  pooled  within-class 
mean  square,  is  valid  only  when  the  intrinsic  within-class  variability  is 
the  same  for  each  class.  Indeed,  a  pooled  estimate  of  within-class 
variability,  obtained  by  combining  measures  of  the  respective  within- 
class  dispersions,  may  be  useless  for  practical  purposes,  unless  there  is 
homogeneity  of  within-class  variability  or  the  method  of  combining  the 
within-class  dispersions  takes  proper  account  of  the  relative  importance 
(for  example,  the  relative  frequency  of  occurrence)  of  the  respective 
classes  under  realistic  conditions. 

Of  the  various  criteria  that  have  been  advanced  for  testing  the  homo- 
geneity of  a  set  of  variance  estimates,  the  test  proposed  by  Bartlett  is 
generally  considered  to  be  the  best  one  available  of  the   " omnibus" 
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variety.*  An  omnibus  test  is  one  that  has  good  discriminating  power 
with  respect  to  a  large  variety  of  alternatives  to  the  null  hypothesis. 
The  customary  F  test  of  the  homogeneity  of  a  set  of  means  is  often  used 
as  an  omnibus  test.  This  is  the  case  when  there  are  k  means  and  the 
between-means  mean  square  having  k  —  1  degrees  of  freedom  is  com- 
pared with  the  error  mean  square.  But  "the  statistician  who  does  not 
know  in  advance  with  which  type  of  alternative  to  [homogeneity]  he  may 
be  faced  is  in  the  position  of  a  carpenter  who  is  summoned  to  a  house  to 
undertake  a  job  of  an  unknown  kind  and  is  only  able  to  take  one  tool 
with  him!  Which  shall  it  be?  Even  if  there  is  an  'omnibus'  tool,  it  is 
likely  to  be  far  less  sensitive  at  any  particular  job  than  a  specialized  one; 
but  the  specialized  tool  will  be  quite  useless  under  the  wrong  conditions,  "f 
It  is  therefore  advisable  in  applications  of  the  F  test  to  isolate  and  test 
individually  the  mean  squares  corresponding  to  comparisons  that  have 
special  interest  a  priori,  or  that  represent  comparisons  between  rational 
subgroups  of  the  data  that  may  have  unequal  (population)  means  for 
various  a  priori  reasons.  By  thus  adapting  the  F  test  to  the  circum- 
stances of  the  study  involved,  its  discriminating  power  is  improved,  at 
least  with  respect  to  the  alternatives  to  homogeneity  that  are  believed 
to  be  the  most  likely  to  exist  or  to  be  of  special  importance  if  they  do 
exist.  Bartlett  has  pointed  out  that  his  over-all  test  for  homogeneity  of 
a  set  of  variances  can  be  separated,  in  similar  fashion,  into  parts  represent- 
ing comparisons  of  special  interest  between  subsets  of  variance  estimates, 
and  the  statistical  significance  of  each  part  tested  individually .  %  Like 
the  F  test  for  homogeneity  of  means,  Bartlett's  test  has  been  shown  to  be 
unbiased  and  to  possess  various  other  desirable  properties.  §  When  the 
issue  is  homogeneity  of  variances,  Bartlett's  test  assumes  the  role  played 
by  the  F  test  when  the  issue  is  homogeneity  of  means. 

Bartlett's  test,  though  applicable,  is  somewhat  inappropriate  when 
the  alternative  to  homogeneity  that  is  most  likely  to  represent  the  true 
situation,  or  that  is  of  special  importance  when  it  does  obtain,  is  that  the 
expected  value  of  some  one  of  the  mean  squares  (variance  estimates) 
substantially  exceeds  the  expected  values  of  the  others,  which  may  be 
homogeneous  or  not.     By  "some  one"  it  is  meant  that  prior  to  the 

*  Bartlett,  "Sufficiency  and  Statistical  Tests,"  especially  p.  273.  Also  see 
Bishop  and  Nair,  "Homogeneity  of  a  Set  of  Estimated  Variances";  Pitman,  "Loca- 
tion and  Scale  Parameters";  and  Hartley,  "Homogeneity  of  a  Set  of  Variances." 
Bartlett's  test  is  described  in  the  next  section  of  this  chapter;  also  in  Rider,  Statistical 
Methods,  Chap.  7,  Sec.  51,  where  an  illustration  is  given.  (For  full  citations,  see 
Sec.  5.) 

f  Pearson,  "Efficiency  of  Statistical  Tools,"  p.  316. 

X  Bartlett,  "Examples  of  Statistical  Methods,"  especially  pp.  158-159. 

§  Pitman,  "Location  and  Scale  Parameters,"  especially  Sees.  2  to  6;  also  Hartley, 
"Homogeneity  of  a  Set  of  Variances." 
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analysis  of  the  data — or,  better,  prior  to  the  collection  of  the  data — there 
is  no  basis  for  identifying  the  anomalous  one  of  the  set.  The  several 
variance  estimates  may  correspond,  for  example,  to  the  several  machines 
of  a  production  line  and  represent  the  within-machine  variability  of 
products  produced  by  the  respective  machines,  which  variability  may  be 
expected  to  be  the  same  from  machine  to  machine  unless  one  of  the 
machines  is  out  of  adjustment,  in  which  event  the  variability  of  products 
from  this  machine  will  exceed  that  of  products  from  other  machines. 
Similarly,  the  several  variance  estimates  may  correspond  to  samples 
taken  from  a  production  line  at  various  times  during  a  shift  and  may  be 
expected  to  be  homogeneous  if  the  process  is  in  control  but  to  have  a 
high  value  if  the  process  was  out  of  control  on  one  occasion.  In  situa- 
tions of  this  type  what  is  needed  is  a  test  of  the  statistical  significance 
of  the  amount  by  which  the  largest  of  the  variance  estimates  exceeds 
the  others.  Cochran  has  developed  such  a  test  and  has  provided  a  small 
table  of  5  percent  significance  levels  to  facilitate  its  use.  * 

Cochran's  test  for  the  statistical  significance  of  the  largest  of  a  set  of 
variance  estimates  is  described  and  its  use  illustrated  in  Sec.  3.  Table 
15.1,  which  gives  5  percent  levels  of  the  test,  is  an  extension  of  Cochran's 
table.  Table  15.2  gives  1  percent  levels,  which  were  not  tabulated  by 
Cochran.  In  view  of  the  approximations  used  in  their  computation, 
many  of  the  entries  in  these  tables  are  not  accurate  to  4  decimal  places, 
as  tabulated.  Their  accuracy  is  believed  to  be  sufficient  for  most  prac- 
tical purposes,  at  least  until  more  accurate  values  are  available.  For 
discussion  of  the  methods  of  computation  see  Sec.  4. 

2.  BARTLETT'S  TEST 

Let  s\,  si,  .  .  .  ,  si  be  k  variance  estimates  distributed  independently 
as  x2a2/n  for  rii,  n2,  .  .  .  ,  nk  degrees  of  freedom,  respectively,  and  s2  be 
the  pooled  variance  of  the  k  samples 

s>  =  ^  (1) 

where 

n  =  2n;  (2) 

Bartlett's  test  involves  computing 

B  =  ^  (  n  loge  s2  -  J.  n{  loge  s? J  (3) 

2.30259/    .  ■        V       i  2\ 

=  q — \n  logio  s2  —    }ni  logio  s\\ 

*  Cochran,  "Largest  of  a  Set  of  Estimated  Variances.'' 
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where 


71 

C  =  1+W^w  (4) 

Then  even  for  quite  small  values  of  the  ni}  say,  5  or  more,  B  has  approxi- 
mately the  x2  distribution  with  k  —  \  degrees  of  freedom.  Since  C  >  1, 
it  need  not  be  calculated  if  for  0  =  1  the  value  of  B  is  not  significant.  * 

3.  COCHRAN'S  TEST 
3.1.  Tables 

Let  s{,  s2,  .  .   .   ,  s£  be  k  variance  estimates  distributed  independently 
as  x20"2/w  for  n  degrees  of  freedom  each,  and  let 

largest  s2 


9       s\  +  sl+   ■   ■   ■   +sl  (5) 

be  the  ratio  of  the  largest  s2  to  their  total.  ■  Table  15.1  gives  values 
of  g  to  4  decimals  that  are  exceeded  with  probability  P,  where 
0.04875  <  P  <  0.05000,  f or  n  =  1  to  10,  16,  36,  144,  oo ,  and  k  =  2  to  10, 
12,  15,  20,  24,  30,  40,  60,  120,  oo.  Table  15.2  gives  values  of  g  to  4 
decimals  that  are  exceeded  with  probability  P,  where  0.00995  <  P 
<  0.01000,  for  the  same  values  of  k  and  n.  These  two  tables  thus  show 
the  5  percent  and  1  percent  levels  of  g.  They  are  exact  levels  (to  4-deci- 
mal  accuracy)  whenever  they  exceed  0.5000  and  in  many  other  instances 
also.  For  example,  a  calculation  of  Cochran's  implies  that  the  entries 
in  Table  15.1  for  n  <  10  and  k  <  10  are  exact  5  percent  levels  to  4-place 
accuracy.  Presumably  the  entries  of  Table  15.2  for  n  <  10  and  k  <  10 
are  likewise  exact  1  percent  levels  to  4-place  accuracy.  Values  of  n  and 
k  exceeding  10  were  selected  for  accuracy  in  interpolation.  Thus  n-wise 
interpolation  should  be  carried  out  in  terms  of  12/\/n;  k-wise  interpola- 
tion, in  terms  of  120 /k  or  60/ A;. 

3.2.  Example 

The  following  are  the  squares  of  sample  standard  deviations  obtained 
from  20  sets  of  six  strength  tests  of  cement  mortar,  each  set  being  obtained 
from  a  different  mixing  of  the  same  cement  mortar  under  similar  condi- 

*  Since  this  chapter  went  to  press,  Thompson  and  Merrington  have  provided 
"Tables  for  Testing  the  Homogeneity  of  a  Set  of  Estimated  Variances."  Writing 
(3)  above  as  B  =  M/C,  they  have  tabled  maximum  and  minimum  values  of  the  true 
5  percent  and  1  percent  significance  levels  of  M  for  k  =  3  to  15  and  for 

a  =  S(k  -  1)(C  -  1) 

from  0  to  5  by  steps  of  0.5;  5  to  10  by  steps  of  1;  12;  and  14. 
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tions.*    Are  these  20  squared  standard  deviations  homogeneous?     In 


Sample   Number 

(S.D.)2 

1 
271 

2 

389 

3 

617 

4 
107 

5 
1052 

6 
340 

7 
383 

8 
508 

9 

511 

10 

438 

Sample    Number 

(S.D.)2 

11 
280 

12 
209 

13 
936 

14 

281 

15 
346 

16 
383 

17 
492 

18 
512 

19 
462 

20 
349 

particular,  is  the  standard  deviation  computed  from  the  fifth  sample 
" out  of  line"? 

In  this  case  n  =  5,  k  =  20,  and  g  =  1,052/8,866  =  0.1187,  which  is 
less  than  the  corresponding  5  percent  level  (0.1735)  given  in  Table  15.1. 
The  anomalous  value  of  the  standard  deviation  from  the  fifth  sample  may, 
therefore,  be  attributed  to  chance,  and  the  20  sets  of  strength  tests  may 
be  regarded  as  homogeneous  with  respect  to  variability  of  the  strength 
determinations. 

The  squared  standard  deviations  given  above  correspond  to  the 
definition 


(Sample  standard  deviation) : 


£    (Xi  -  X)' 


where  x  =  sample  average 


6 
=  6  2/  **' 


the  Xi  (i  =  1,  2, 


(6) 


6)  being 


the  6  individual  strength  determinations  of  the  sample  concerned,  in 
pounds  per  square  inch.  Consequently,  each  value  is  h/§  of  the  cor- 
responding variance  estimate  s2,  defined  by 


y  (xi  —  x)' 

i  =  l 


(7) 


However,  the  factor  %  is  common  to  the  numerator  and  denominator 
of  g,  so  need  not  be  taken  into  account  explicitly. 


4.  COMPUTATION  OF  TABLES 

??,  si,  .  .  .  ,  si  are  variance  estimates 
distributed  independently  as  x2o"2/^  for  n  degrees  of  freedom,  and  if 


Cochran  has  shown f  that  if  s\,  st>, 


9i  = 


4  +  sl  + 


+  4 


(i  =  1,  2, 


,k) 


(8) 


*  Davies  and  Pearson,  "Estimating  from  Samples,"  Table  IV,  p.  92;  a  descrip- 
tion of  the  tests  appears  on  p.  77. 

t  Cochran's  r  and  n  correspond  to  our  n  and  k,  respectively. 
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then  the  probability  that  the  largest  of  the  ratios  (8)  exceeds  g  is  given  by 

P(g)  -  kPM  -  *fi^Pi(0  +  Kh  ~  if  ~-2) PM  -  •  •  • 

+  (-l)*-1P*(?)     (9) 

where  P\{g)  is  the  probability  that  a  specified  one  of  the  ratios  exceeds  g, 
P2(#)  the  probability  that  a  specified  pair  of  the  ratios  both  exceed  g,  and 
so  on.     Clearly 

Pk(g)  =  0         when  g  >  i  (10) 

for  it  is  certain  that  the  largest  ratio  will  be  as  great  as  1/k;  so  the  number 
of  non-zero  terms  in  (9)  is  the  greatest  integer  less  than  1/g. 
Cochran  has  shown  that 

f1     V:-i  n(fc-l)     1 

Pi(g)  =  "■  x2   r(1  ~.f    '  -,    -       o  <  g  <  l  (ii) 

n  n(k  -  1) 

La*  — 2 — 

and  has  given  Pi(g)  and  Ps(^)  in  multiple  integral  form,  from  which  the 
multiple  integral  form  of  P3(g),  j  =  4,  5,  •  •  •  ,  Jc,  is  apparent.  He  has 
pointed  out  that,  since  P2G7)  <  [Pi(^)]2  for  all  values  of  n  and  k,  the 
upper  o;  levels  of  significance  of  g  can  be  evaluated  to  a  good  first  approxi- 
mation using  only  the  first  term  of  (9).     Thus,  if  ga  is  determined  from 


it  follows  from  (9)  that 


«('-?) 


Pi(S.)  =  I  (12) 


<  a  -  ^r a2  <  p(^«)  -  a  (13) 


that  is,  upper  5  percent  levels  of  g  determined  from  (12)  will  in  all  cases 

correspond  to  0.04875  <  P  <  0.05000;  and,  in  view  of  (10),  will  be  exact 

when  they  exceed  J^.     Similarly,  upper  1  percent  levels  determined 

from  (12)  will  correspond  to  0.00995  <  P  <  0.01000  and  will  be  exact 

when  they  exceed  J^- 

From  (11)  it  follows  that 

ga(n,k)  =  1  -xa  (14) 

k 

F 

k 


Fa  +  (*  -  1) 
k 
where 


Xa 

k 


=  xi  [y  !^1)]  (15) 
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is  the  lower  a/k  level  of  significance  of  the  beta-function  distribution,  and 

Fa  =  Fa[n,ri(k  -  1)]  (16) 

k  h 

is  the  upper  a/k  level  of  significance  of  the  F  distribution.  Hence 
go.o&(nfk)  can  be  obtained  from  the  tables  of  Thompson,  or  from  those  of 
Thompson  and  Merringtori,  for  k  =  2,  5,  and  10;  and  go.oi(n,k)  can  be 
obtained  similarly  for  k  =  2.  The  values  of  #0.05  given  in  Table  15.1  for 
k  =  2,  5,  and  10,  and  of  #o.oi  given  in  Table  15.2  for  k  —  2,  were  obtained 
in  this  manner. 

When  n  is  an  even  integer,  (11)  can  be  reduced  to  a  polynomial  in  g. 
Thus,  when  n  =  2 

JMflO  =  (i  -  g)k~l  (17) 

and  when  n  =  4 

Pi(g)  =  (1  -  <7)2*-2[l  +  2(*  -  l)g]  (18) 

as  Cochran  has  pointed  out.  The  corresponding  polynomials  for  n  =  6, 
8,  and  10  were  determined.  The  values  of  #0.05  and  (70.01  given  in  Tables 
15.1  and  15.2  for  n  =  2,  4,  6,  8,  and  10  were  obtained  with  the  aid  of 
these  polynomials.  In  all  other  cases  the  values  of  {70.05  and  <7o.oi  were 
evaluated  from  0.05/fc  and  0.01/Zs  levels  of  F,  which,  in  turn,  were  deter- 
mined by  means  of  a  modification  of  a  procedure  due  to  Paulson. 

It  is  believed  that  as  estimates  of  #0.05  and  $0.01  the  entries  of  Tables 
15.1  and  15.2  are  correct  to  within  1,  or  possibly  2,  units  in  the  fourth 
decimal  place.  The  accuracy  of  the  tabular  entries  as  estimates  of  the 
true  5  percent  and  1  percent  levels  of  g  is  in  general  unknown,  although 
it  is  known  that  the  tabular  values  tend  to  exceed  the  true  values.  It  is 
believed,  however,  that  in  all  cases  the  tabular  5  percent  levels  correspond 
to  0.04875  <  P  <  0.05000  and  the  tabular  1  percent  levels  to  0.00995 
<  P  <  0.01000.  With  this  understanding,  Tables  15.1  and  15.2  should 
be  useful  in  practical  work  until  more  accurate  tables  become  available. 
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CHAPTER  16 
INVERSE  SINE  TRANSFORMATION  OF  PROPORTIONS 

1.  HISTORY  OF  THE  TRANSFORMATION 

If  successive  samples  of  n  items  each  are  drawn  independently  at 
random  from  a  very  large  (infinite)  population  in  which  a  proportion  P 
of  the  items  possess  some  definite  attribute  A,  then  the  proportions 
Ph  Ph  P3j  •  •  •  °f  items  possessing  A  in  the  successive  samples  will  be 
distributed  in  such  a  manner  that  as  the  number  of  independent  samples 
is  increased  without  limit  the  average  of  the  p's  will  tend  to  P  and  the 
mean  of  their  squared  deviations  from  P  will  tend  to  P(l  —  P)/n.  In 
the  language  of  statistics  this  is  expressed  by  saying  that,  if  a  sample  of  n 
items  is  drawn  at  random  from  an  infinite  population  in  which  a  propor- 
tion P  of  the  items  possess  a  definite  attribute  A,  and  if  the  proportion  of 
items  possessing  A  in  the  sample  is  denoted  by  p,  then 

Expected  value  of  p  =  E(p)  =  P  (1) 

and 

Variance  of  p  -  V(p)  =  P(1  -  P)  (2) 


or 


-  **  -  V" 


Standard  deviation  of  p  =  <rp  =  A/ (3) 


Equation  (3)  or  equation  (2),  considered  in  conjunction  with  equa- 
tion (1),  states  that  observed  proportions  p,  based  on  successive  inde- 
pendent random  samples  of  size  n,  may  be  expected  to  be  grouped  more 
closely  about  the  true  proportion  P  when  n  is  large  than  when  n  is  small ; 
for  fixed  sample  size,  their  sampling  variation  about  P  will  be  greatest 
when  P  equals  Jl2  and*  will  decrease  toward  zero  as  either  P  or  1  —  P 
approaches  zero.  This  dependence  of  the  sampling  variation  of  an 
observed  proportion  p  upon  the  true  proportion  that  it  estimates,  P,  is 
a  source  of  inconvenience  in  certain  kinds  of  theoretical  work  and  in  some 
types  of  statistical  analysis  of  research  data.  Basically,  the  incon- 
venience is  a  consequence  of  the  fact  that  for  purposes  of  statistical 
inference  (for  example,  estimation  of  parameters,  tests  of  agreement 
between  observation  and  hypothesis)  the  worth  or  " weight"  of  an  indi- 
vidual observation  (here,  a  single  observed  proportion  p)  is  generally 
inversely  proportional  to  its  sampling  variance.     Accordingly,  the  for- 

397 
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mulas  of  statistical  theory  and  the  calculations  of  statistical  analysis  are 
simpler,  in  general,  when  the  observations  involved  are  of  equal  weight, 
that  is,  of  equal  precision. 

Recognizing  this  fact,  R.  A.  Fisher,  in  a  paper  on  theoretical  genetics 
published  in  1922,*  introduced  the  transformation  cos  <p  =  1  —  2p, 
where  <p  varies  from  0  to  ir  as  p  varies  from  0  to  1,  and  utilized  the  fact 
that,  when  <p  is  computed  from  large  samples,  the  variance  of  <p  is,  for  all 
practical  purposes,  independent  of  the  true  proportion  P. 

Fisher  employed  this  transformation  to  simplify  the  mathematical 
equations  occurring  in  an  application  of  statistical  theory  to  a  problem  in 
genetics.  In  1930  he  published  another  paper  on  this  problemf  in 
which  he  considered  the  bias  of  the  transformation  in  large  samples 
and  showed  that  his  failure  to  make  allowance  for  this  bias  in  his  earlier 
paper  accounted  for  the  discrepancy  between  one  of  the  results  therein 
and  a  solution  to  the  same  problem  obtained  in  a  different  manner  by 
Sewall  Wright.  Not  until  1936  were  the  possibilities  of  this  transforma- 
tion in  connection  with  statistical  analysis  of  research  data  discussed  in 
print,  when  three  writers  indicated  advantages  to  be  gained  from  its  use 
in  statistical  analyses  of  data  expressed  in  the  form  of  observed  propor- 
tions (or  percentages  of  items  counted).  Bartlett,{  on  a  suggestion  from 
Yates,  Fisher's  coworker  at  the  Rothamsted  Experimental  Station, 
considered  the  use  of  a  related  transformation  (sin2  6  =  p,  where  6  varies 
from  0  to  7r/2;  see  below)  as  a  means  of  rendering  proportions  amenable 
to  analysis-of-variance  techniques,  which  assume  equal  precision  of  the 
observations  involved.  With  the  same  objective,  Bliss, §  after  some 
correspondence  with  Fisher,  recommended  the  use  of  the  angular  trans- 
formation sin2  6  =  p  in  the  statistical  analysis  of  field-trial  data  expressed 
in  percentages  (that  is,  100  times  the  proportion  of  items  with  a  given 
attribute  in  a  sample)  and  gave  an  extensive  table  of  6  (in  degrees)  for 
percentages  (that  is,  lQOp)  from  0.01  to  99.99.  Zubin||  pointed  out  the 
advantages  of  using  Fisher's  transformation  as  a  means  of  simplifying 
the  calculations  involved  in  testing  the  statistical  significance  of  a  differ- 
ence between  two  observed  proportions,  his  attention  having  been 
drawn  to  Fisher's  papers  by  Hotelling.  Zubin,  utilizing  this  transforma- 
tion, subsequently  prepared  nomographs  1f  by  means  of  which  the  statis- 
tical significance  of  the  difference  between  two  observed  proportions  can 
be  determined  almost  at  a  glance.  These  nomographs  jointly  are,  in  a 
sense,  the  ancestors  of  the  nomograph  presented  in  Chap.  7  above. 

*  Fisher,  "On  the  Dominance  Ratio,"  p.  326.     (For  full  citations,  see  Sec.  5.1.) 

f  Fisher,  "The  Distribution  of  Gene  Ratios." 

%  Bartlett,  "Square  Root  Transformation  in  Analysis  of  Variance,"  p.  73. 

§  Bliss,  "The  Analysis  of  Field  Experimental  Data  Expressed  in  Percentages." 

||  Zubin,  "Note  on  a  Transformation  Function  for  Proportions  and  Percentages." 

If  Zubin,  "Nomographs  for  Determining  the  Significance  of  the  Difference." 
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Since  1936  a  great  deal  has  been  written  on  the  transformation  of 
proportions  to  angles,  to  facilitate  or  validate  various  forms  of  statistical 
analysis.  A  fairly  complete  bibliography  is  given  at  the  end  of  this 
chapter;  it  includes  references  to  tables  that  have  been  published  to 
facilitate  the  angular  transformations  in  current  use. 

The  remainder  of  this  chapter  is  devoted  to  a  discussion  of  the  effec- 
tiveness of  Fisher's  transformation  in  small  samples,  especially  the 
dependence  on  P  of  the  expected  value  and  the  variance  of  <p.  A  simple 
adjustment,  due  to  Bartlett,*  is  discussed,  and  a  table  (Table  16.4)  of 
adjusted  values  provided  to  facilitate  its  use.  More  elaborate  adjust- 
ments, determined  by  the  data  themselves  in  any  given  case,  have  been 
discussed  by  Cochran,  f  and  tables  J  are  available  to  facilitate  their  use. 
In  the  last  section  of  this  chapter  the  general  method  of  determining 
transformations  to  stabilize  variance  is  outlined,  the  angular  transforma- 
tion of  Fisher  derived,  and  several  other  such  transformations  noted.  A 
method  of  discovering  Bartlett's  adjustment  is  noted,  which  may  be  use- 
ful in  other  situations. 

2.  NATURE  OF  THE  TRANSFORMATION 

Let  p  denote  the  proportion  of  items  possessing  some  particular  attri- 
bute in  a  sample  of  n  items  drawn  independently  at  random  from  a  very 
large  (infinite)  population  of  items  in  which  a  proportion  P  of  the  items 
possess  the  attribute  in  question.  Under  these  circumstances  the  mean, 
variance,  and  standard  deviation  of  p  are  functions  of  P,  given  by  equa- 
tions (1),  (2),  and  (3).  Let  <p  be  an  angle  (in  radians)  determined  by  the 
transformation 

(p  =  2  arcsin  Vp  —  arccos  (1  —  2p)         0  <  p  <  1         0  <  <p  <  ir      (4) 
the  inverse  of  which  is 

v  =  gin*  |  =  1  ~  ^°S  g         0  <p  <  1         0  <  <p  <  it  (5) 

Then,  when  the  number  of  observations  n  upon  which  p  is  based  is  large, 

Expected  value  of  <p  =  E(<p)  (6) 

=  <p(P)  approximately 


where 


<p(P)  =  2  arcsin  \/P  (7) 


*  Bartlett,  "Some  Examples  of  Statistical  Methods,"  p.  168. 
t  Cochran,  "The  Analysis  of  Variance  When  Experimental  Errors  Follow  the 
Poisson  or  Binomial  Laws,"  pp.  342  ff. 
%  Fisher  and  Yates,  Statistical  Tables, 
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(8) 


An  elementary  derivation  of  this  transformation  is  given  in  Sec.  4.  This 
is  the  form  in  which  the  transformation  was  introduced  by  Fisher  in  1922; 
the  transformation  has  been  employed  in  this  form  by  Zubin  and,  with 
certain  modifications,  by  the  Statistical  Research  Group,  f 

Other  writers,  following  Bartlett  and  Bliss,  have  transformed  p  to  an 
angle  6  equal  to  <p/2,  whence  V(0)  =  l/4n  approximately  for  0  in  radians 
or  821/n  approximately  for  6  in  degrees.  Thus,  when  0  is  expressed  in 
radians,  V(0)  has,  for  a  wide  range  of  values  of  P,  approximately  the 
value  of  V(p)  when  P  =  %.  This  advantage,  if  such  it  is,  is  lost,  how- 
ever, when  0  is  expressed  in  degrees. 

TABLE  16.1 

Means  and  Variances  of  Various  Functions  of  a  Sample  Proportion    for 

Selected  Values  of  the  Population  Proportion  When  the  Sample  Size 

Is  10 


p 

<P(P) 

E{p) 

40V  (p) 

E(<p) 

10V(<p) 

E(<pB) 

107(^0 

0.0500 

0.4510 

0.0500 

0.1900 

0.2856 

1.2917 

0.4757 

0.4481 

0.1000 

0.6435 

0.1000 

0.3600 

0.5134 

1 . 6567 

0.6241 

0.7489 

0.1500 

0.7954 

0.1500 

0.5100 

0.7006 

1.6529 

0.7632 

0.9363 

0 . 2000 

0.9273 

0.2000 

0.6400 

0 . 8598 

1.5332 

0 . 8939 

1.0436 

0.3000 

1.1593 

0 . 3000 

0 . 8400 

1 . 1260 

1.2968 

1.1350 

1.1224 

0.5000 

1.5708 

0.5000 

1.0000 

1.5708 

1 . 1433 

1.5708 

1.1258 

0.7000 

1.9823 

0.7000 

0 . 8400 

2.0156 

1.2968 

2.0066 

1 . 1224 

0.8000 

2.2143 

0.8000 

0 . 6400 

2.2818 

1.5332 

2.2477 

1.0436 

0.8500 

2 . 3462 

0.8500 

0.5100 

2.4410 

1.6529 

2 . 3784 

0.9363 

0.9000 

2.4981 

0 . 9000 

0 . 3600 

2 . 6282 

1 . 6567 

2.5175 

0.7489 

0 . 9500 

2 . 6906 

0.9500 

0.1900 

2 . 8560 

1.2917 

2.6659 

0 . 4481 

Let 


Explanation 
P  denote  the  proportion  of  items  possessing  a  definite  characteristic  in  an 

infinite  population 
p  denote  the  proportion  of  items  possessing  this  characteristic  in  a  sample  of 
10  items  drawn  independently  at  random  from  the  population 
Then  <p(p)  =  2  arcsin  -\/p  =  arccos  (1  —  2p),  0<  p  <  1,  0  <  <p  <  ir 


<pe{p)  =  <p(p)  for  0  <  p  <  1,  <fB(0)  =  ip(Ho),  <pb(1)  =  tt 


»(0) 


and  E{     )  denotes  average  or  expected  value  of  the  quantity  within  the  parentheses 
V{     )  denotes  variance  of  the  quantity  within  the  parentheses. 


*  If  <p  is  measured  in  degrees 

Vf  s  -  /180V  1  _ 


3282.8 


f  Sequential  Analysis,  Sec.  3.6;  also  Chap.  7  of  this  volume. 
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3.  EFFECTIVENESS  OF  THE  TRANSFORMATION 

Apparently  the  only  published  investigation  of  the  effectiveness  of  the 
inverse  sine  transformation  in  small  samples  is  that  of  Bartlett*  for  the 
case  of  n  =  10.  His  analysis  shows  that  when  n  =  10,  the  variance  of  <p 
depends  very  markedly  on  the  true  proportion  P.  A  year  later  he 
remarked f  that  a  good  correction  is  to  replace  p  =  0  by  p  =  l/4n,  and 
p  =  lbyp  =  l  —  l/4n,  although  he  gave  no  supporting  evidence. 

Tables  16.1  to  16.3  show,  for  n  —  10,  20,  and  30,  respectively,  the 
values  to  4  decimal  places  of  E(p),  AnV(p),  <p(P),  E(<p),  nV(<p),  E(<pB), 
and  nV(<pB)  corresponding  to  P  =  0.05  to  0.20  by  steps  of  0.05,  0.30, 
0.50,  0.70,  and  0.80  to  0.95  by  steps  of  0.05,  where  yB  denotes  the  angular 
transformation  (4)  modified  as  suggested  by  Bartlett,  that  is, 


<Pb(p)  =  2  arcsin  \/p  =  arccos  (1  —  2p)         0<p<l         0<<pB<ir 
<pB(0)  =  2  arcsin  \/l/4n,         <pb{1)  =  tt  —  <pb(0) 


(9) 


TABLE  16.2 

Means  and  Variances  of  Various  Functions  of  a  Sample  Proportion  for 

Selected  Values  of  the  Population  Proportion  When  the  Sample  Size 

Is  20 


p 

<P-(P) 

E(p) 

807(p) 

E(<p) 

20V(<p) 

E(<pb) 

20V  (<pB) 

0.0500 

0.4510 

0.0500 

0 . 1900 

0.3541 

1.6315 

0.4344 

0.7247 

0 . 1000 

0.6435 

0 . 1000 

0 . 3600 

0 . 5866 

1.5458 

0.6139 

1.0138 

0.1500 

0 . 7954 

0.1500 

0.5100 

0.7602 

1.3240 

0.7688 

1.0974 

0.2000 

0.9273 

0.2000 

0.6400 

0.9035 

1.1810 

0.9061 

1.1051 

0.3000 

1.1593 

0 . 3000 

0 . 8400 

1 . 1469 

1.0858 

1.1471 

1.0784 

0.5000 

1 . 5708 

0.5000 

1.0000 

1.5708 

1.0562 

1 . 5708 

1.0561 

0 . 7000 

1.9823 

0.7000 

0 . 8400 

1.9947 

1.0858 

1.9945 

1.0784 

0.8000 

2.2143 

0.8000 

0 . 6400 

2.2381 

1.1810 

2.2355 

1.1051 

0 . 8500 

2 . 3462 

0.8500 

0.5100 

2.3814 

1.3240 

2.3728 

1.0974 

0.9000 

2.4981 

0.9000 

0 . 3600 

2 . 5550 

1 . 5458 

2 . 5277 

1.0138 

0.9500 

2 . 6906 

0.9500 

0.1900 

2.7875 

1.6315 

2.7072 

0.7247 

Let 


Explanation 
P  denote  the  proportion  of  items  possessing  a  definite  characteristic  in  an 

infinite  population 
p  denote  the  proportion  of  items  possessing  this  characteristic  in  a  sample  of 
20  items  drawn  independently  at  random  from  the  population 
Then  <p(p)  =  2  arcsin  -y/p  =  arccos  (1  —  2p),  0  <  p  <  1,  0  <  <p  <  ir 
<pb(p)  =  <p(p)  for  0  <  p  < 1,  <pb(0)  =  <p(Ho),  <pb(1)  =  tt  -  <pB(0) 
and  E(     )  denotes  average  or  expected  value  of  the  quantity  within  the  parentheses 
V(     )  denotes  variance  of  the  quantity  within  the  parentheses. 

*  Bartlett,  "Square  Root  Transformation  in  Analysis  of  Variance,"  pp.  74-75. 
t  Bartlett,  "Some  Examples  of  Statistical  Methods,"  p.  168. 
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The  mean  values  and  variances  of  <p  and  cpB  were  determined  by  direct 
calculation.* 

If  the  inverse  sine  transformation  completely  stabilized  variance  in 
small  samples,  then  nV(<p)  would  be  unity  in  all  cases,  or  at  least  would 
be  constant  for  a  given  value  of  n.  That  this  is  not  the  case  is  clear 
from  Tables  16.1  to  16.3.  Evidently,  nV((pB)  is  somewhat  less  variable 
than  nV(cp)  over  the  range  of  values  of  P  considered.  The  nature  of 
the  dependence  of  nV(<p)  and  nV(<pB)  on  P  is  depicted  graphically  in 
Figs.  16.1  to  16.3,  for  n  =  10,  20,  and  30,  respectively.  For  purposes  of 
comparison  4nF(p)  has  been  plotted  also.  The  following  conclusions 
may  be  drawn: 

(1)  For  n  >  10  and  0.05  <  P  <  0.95,  nV(<p)  and  nV(<pB)  are  less 
variable  than  4nV(p). 

TABLE  16.3 

Means  and  Variances  of  Various  Functions  of  a  Sample  Proportion  for 
Selected  Values  of  the  Population  Proportion  When  the  Sample  Size 

Is  30 


p 

<p(P) 

E(v) 

120F(p) 

E(<p) 

307(<?) 

E(<pB) 

30V  (<pB) 

0.0500 

0.4510 

0.0500 

0 . 1900 

0.3885 

1 . 6443 

0.4278 

0.8985 

0  1000 

0.6435 

0.1000 

0.3600 

0.6119 

1 . 3359 

0.6197 

1.0921 

0.1500 

0.7954 

0.1500 

0.5100 

0.7757 

1.1560 

0.7771 

1.0987 

0.2000 

0.9273 

0 . 2000 

0 . 6400 

0.9134 

1.0885 

0.9136 

1.0773 

0 . 3000 

1.1593 

0.3000 

0.8400 

1.1516 

1.0484 

1.1516 

1.0481 

0.5000 

1.5708 

0.5000 

1.0000 

1.5708 

1.0358 

1.5708 

1.0358 

0.7000 

1.9823 

0.7000 

0 . 8400 

1.9900 

1.0484 

1.9900 

1.0481 

0.8000 

2.2143 

0 . 8000 

0 . 6400 

2.2282 

1.0885 

2 . 2280 

1.0773 

0 . 8500 

2 . 3462 

0.8500 

0.5100 

2.3659 

1.1560 

2.3645 

1.0987 

0.9000 

2.4982 

0.9000 

0 . 3600 

2 . 5297 

1.3359 

2.5219 

1.0921 

0.9500 

2.6806 

0 . 9500 

0.1900 

2.7531 

1.6443 

2.7138 

0 . 8985 

Explanation 

Let         P  denote  the  proportion  of  items  possessing  a  definite  characteristic  in  an 
infinite  population 
p  denote  the  proportion  of  items  possessing  this  characteristic  in  a  sample  of 
30  items  drawn  independently  at  random  from  the  population 
Then  <f{p)  =2  arcsin  -y/p  =  arccos  (1  —  2p),  0  <  p  <  1,  0  <  <p  <V7r 

<pb(p)  =  <p(p)  for  0  <  p  <  1,  <pB(0)  =  <p(K2o),  <pb(1)  =  w. —  <pb(0) 
and  E(     )  denotes  average  or  expected  value  of  the  quantitv  within  the  parentheses 
V(     )  denotes  variance  of  the  quantity  within  the  parentheses. 


*  Binomial  probabilities  to  7  decimals  were  obtained  from  Tables  of  the  Incomplete 
Beta  Function;  arcsin  values  to  6  decimals  were  obtained  by  interpolation  in  Hayashi's 
Sieben-und  mehrstellige  Tafeln,  the  entries  of  this  latter  table  being  corrected  in  some 
cases  in  accordance  with  the  list  of  corrections  given  in  Mathematical  Tables  and  Other 
Aids  to  Computation,  Vol.  1,  No.  6,  p.  197. 
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(2)  For  n  >  20  and  0.05  <  P  <  0.95,  nV(<pB)  is  less  variable  than 
nV(<p). 

(3)  For  n  >  10,  and  0.075  <  P  <  0.925,  nV(<pB)  is  less  variable  than 
nV(<p). 

Since  it  is  quite  apparent  from  the  foregoing  that  the  inverse  sine 
transformation,   especially  with  Bartlett's  correction,  is  advantageous 

Let  P  denote  the  proportion  of  items  in  on  infinite  population  that  possess  a  given  characteristic; 

p  denote  the  proportion  of  items  possessing  the  characteristic  among  the  10  items  drawn 
independently  at  random  from  the  population; 

<t>(p)  =  2arcsin  Vp  =  arccos(1-2pj; 

<t>B(p)  =  0(p),  O  <p<1;  <DB(0)^  <D(1/40);   <t>B(1 )  =  IT  -  <t>g(0); 

then   V1  =  4P(1-P)  =  4x10xV(p) 
V2=  10  V(<P) 

V3=10V(OB) 

V 
2.0 


v2  ^ 


1.0 


Fig.  16.1. — Dependence  of  the  variances  of  three  functions  of  a  sample  proportion  p  on  the 
population  proportion  P  when  the  sample  size  is  10. 


from  the  viewpoint  of  variance  stabilization,  its  effectiveness  in  this 
regard  increasing  rapidly  as  the  sample  size  n  is  increased,  the  question 
arises  whether  estimates  of  P  obtained  by  transforming  back  to  propor- 
tions after  averaging  in  angular  units  will  be  biased.  A  comparison 
of  the  values  of  <p(P),  E(<p),  and  E(<pB)  shown  in  Tables  16.1  to  16.3 
reveals  that  E(<p)  and  E((pB)  generally  are  less  than  <p(P)  when  P  <  J^; 
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greater  than  <p(P)  when  P  >  }i;  and  the  three  are  equal  when  P  =  3^-* 
Furthermore,  for  all  values  of  n  and  P  considered  in  these  tables,  E(ipB) 
is  the  closer  to  <p(P).  From  Tables  16.1  to  16.3  it  is  seen  that,  when 
0.05  <  P  <  0.95,  the  bias  E(<pB)  -  <p(P)  is  less  than  15  per  cent  of  the 

Let  P  denote  the  proportion  of  items  in  an  infinite  population  that  possess  a  given  characteristic; 

p  denote  the  proportion  of  items  possessing  the  characteristic  among  the  20  items  drawn 
independently  at  random  from  the  population; 

<t>(p)  =  2arcsin  Vp  =  arccos(1-2p); 

<t>B(p)=<t>(p),  0<p«=1;  OB(0)=  0(1/80);   0g(1)  =  IT-  <t>B(0); 
then   V1  =  4P(1-P)  =  4x20x  V(p) 
V2=20  V(O) 


1.8 


1.6 


1.4 


1.2 


1.0 


2  \ 

/Vj 


0 
Fig. 


1.0 


16.2. — Dependence  of  ^he  variances  of  three  functions  of  a  sample  proportion  p  on  the 
population  proportion  P  when  the  sample  size  is  20. 

standard  deviation  of  ipB  when  n  >  10  and  thus  is  of  negligible  magnitude 
for  most  practical  purposes.  Thus  Bartlett's  correction,  in  addition  to 
its  advantage  from  the  viewpoint  of  stabilizing  variance,  tends  to  reduce 
to  negligible  magnitudes  the  biases  of  estimates  of  P  obtained  by  recon- 
verting averages  expressed  in  angular  units  to  estimated  mean  propor- 
tions with  the  aid  of  equation  (5). 

*  The  values  of  E(<ps)  for  P  =  0.05  and  P  =  0.95  when  n  =  10  are  exceptions. 
Such  exceptions  can  be  expected  to  occur  with  <pb  in  those  instances  in  which  the  p  =  0 
or  v  =  1  class  has  a  large  probability  of  occurrence,  that  is,  when  P  is  close  to  0  or  1 
and  n  is  small. 
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Values  of  <pb(0)  and  <pb(1),  expressed  in  radians,  have  been  tabulated  in 
Table  16.4  for  sample  sizes  from  10  to  50,  in  order  to  facilitate  the  use 
of  Bartlett's  correction.  The  values  of  <pB(0)  and  <pB(l)  given  in  Table 
16.4  may  be  used  in  conjunction  with  Zubin's  " Table  for  Transforming 
Proportions  to  T  values"*  without  modification;  they  should  be  divided 

Let  P  denote  the  proportion  of  items  in  an  infinite  population  that  possess  a  given  characteristic; 

p  denote  the  proportion  of  items  possessing  the  characteristic  among  the  30  items  drawn 
independently  at  random  from  the  population; 

O(p)  =  2arcsin  Vp  -  arccos(1-2p); 

«>B(p)=<t>(p),0<p«=1;  0>B(0)=4> (1/120);    Og(1 )  =  ir-  <t>B(0); 

then   V1=4P(1-P)  =  4x  30xV(p) 


2.0 
1.8 
1.6 
1.4 
1.2 
1.0 
.8 
.6 


| . p-y 

A  I  '  I  I  /  


0  .1  .2  .3  .4  .5  .6  7  P  ?  1.0 

P 
Fig.   16.3. — Dependence  of  the  variances  of  three  functions  of  a  sample  proportion  p  on 
the  population  proportion  P  when  the  sample  size  is  30. 


by  2  before  being  used  in  conjunction  with  Bartlett's  tablet;  they  should 
be  multiplied  by  28.64789  before  being  used  in  conjunction  with  the 
tables  of  Bliss,  Fisher  and  Yates,  and  Snedecor. 

A  previous  publication  of  the  Statistical  Research  Group  gives  a 
tablef  of  the  transformation 

*  Zubin,  "Note  on  a  Transformation,"  Table  4,  p.  220. 

t  Bartlett,  " Subsampling  for  Attributes,"  Table  II,  p.  132. 

X  Sequential  Analysis,  Table  3.64,  p.  3.57.  "  *    '  ' 
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y(s,n)  —  2  arcsin 

y(0,n)  =  V2/n 

y(n,n)  =  w  —  \/2/n        for  s  =  n 


<  n,  0  <  y  < 


(10) 


for  values  of  n  from  10  to  50.  Comparison  of  the  foregoing  with  equation 
(9)  above  reveals  that  y(s)  =  <pB(p)  with  p  =  s/n  for  0  <  s  <  n; 
y(0)  =  \/2<pb(0)  approximately  and  y(n)  =  tt  —  \f2<pB(0)  approxi- 
mately. Since  the  publication  of  this  table  of  the  y  transformation, 
it  has  been  found  that  Bartlett's  adjustments,  <pB(0)  and  <pB(l),  are 
slightly  more  effective  for  stabilizing  variance,   especially  when  P  is 

TABLE  16.4 
Values  of  ^s(O)  and  <pb0-)  for  Samples  of  10  to  50  Items  Together  with  Values 
of  the  Variance  of  <pB  When  P  =  0.5 


Sample 

<pb(P), 

wd), 

Vo.h(<PB)t 

Sample 

<pb(0), 

Ml), 

V0.b(<pB) 

size,  n 

radians 

radians 

radians2 

size,  n 

radians 

radians 

radians2 

10 

0.3176 

2.8240 

0.1126 

30 

0.1828 

2.9588 

0.0345 

11 

0.3027 

2.8389 

0.1011 

31 

0.1798 

2.9617 

0.0334 

12 

0.2897 

2.8519 

0.0918 

32 

0.1770 

2.9646 

0.0323 

13 

0.2782 

2.8633 

0.0840 

33 

0.1743 

2.9673 

0.0313 

14 

0.2681 

2.8735 

0.0775 

34 

0.1717 

2 . 9699 

0.0303 

15 

0.2589 

2 . 8827 

0.0719 

35 

0.1692 

2 . 9724 

0.0294 

16 

0.2507 

2.8909 

0.0670 

36 

0.1669 

2.9747 

0.0286 

17 

0 . 2431 

2.8985 

0.0628 

37 

0.1646 

2.9770 

0.0278 

18 

0.2363 

2.9053 

0.0591 

38 

0.1624 

2.9792 

0.0270 

19 

0.2299 

2.9117 

0.0558 

39 

0.1603 

2.9813 

0.0263 

20 

0.2241 

2.9175 

0.0528 

40 

0.1583 

2.9833 

0.0257 

21 

0.2187 

2.9229 

0.0501 

41 

0.1563 

2.9853 

0.0250 

22 

0.2136 

2.9280 

0.0477 

42 

0.1545 

2.9871 

0.0244 

23 

0.2089 

2.9327 

0.0456 

43 

0.1526 

2.9889 

0.0238 

24 

0 . 2045 

2.9371 

0.0436 

44 

0.1509 

2.9907 

0.0233 

25 

0 . 2003 

2.9413 

0.0417 

45 

0.1492 

2.9924 

0.0227 

26 

0.1964 

2.9452 

0.0401 

46 

0.1476 

2.9940 

0.0222 

27 

0.1927 

2.9488 

0.0385 

47 

0.1460 

2.9956 

0.0217 

28 

0.1893 

2.9523 

0.0371 

48 

0.1445 

2.9971 

0.0213 

29 

0.1860 

2.9556 

0.0358 

49 

0 . 1430 

2.9986 

0.0208 

50 

0.1415 

3.0001 

0.0204 

Explanation 
Let  P  denote  the  proportion  of  items  possessing  a  definite  characteristic  in  an 

infinite  population 
p  denote  the  proportion  of  items  possessing  this  characteristic  in  a  sample 
of  n  items  drawn  independently  at  random  from  the  population 
Then  <pb(p)  =  2  arcsin  y/p  =  arccos  (1  —  2p),  0  <  p  <  1,  0<^<ir 

<Pb(0)  =  2  arcsin  y/\/4n\  <pb(1)  =  tt  —  ?B(0) 
and     V0.b{<pB)  denotes  the  variance  of  <pb  when  P  =  0.5. 

Note:  Whenn  >  50,  <pB(Q)  =  y/l/4n,  <Pb(1)  =  tt  -  <^(1),  and  V0.s(<pb)  =  1/n  +  1/n2 
are  correct  to  within  1  unit  in  the  fourth  decimal  place. 
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near  0.05  or  0.95,  than  the  simpler  adjustments  y(0,ri)  and  y(n,n). 
It  is  recommended,  therefore,  that,  when  special  accuracy  is  desired, 
the  values  of  <pb(0)  and  <pb(1)  given  in  Table  16.4  of  the  present  chapter  be 
used  in  place  of  the  values  of  y(0,n)  and  y(n,n)  shown  in  the  table  of  the 
y  transformation. 

From  Figs.  16.1  to  16.3  it  is  clear  that,  for  a  fairly  wide  range  of  values 
of  P  about  P  =  0.5,  the  variance  of  <pB,  nV((pB)  being  plotted  as  F3  in 
these  figures,  is  quite  close  to  its  value  for  P  =  0.5,  say  Vo.5(<pb),  and 
that  Vo.b(<pB)  approaches  \/n  as  n  increases.  Indeed,  from  a  considera- 
tion of  these  three  cases  (n  =  10,  20,  30)  it  appears  that 

v       ".        1       0.974       3.08       2.4 

Using  this  formula,  values  of  Vo.z(<pB)  were  computed  for  intermediate 
values  of  n,  that  is,  for  10  <  n  <  20  and  20  <  n  <  30,  and,  in  the 
absence  of  any  other  formula  giving  correct  values  for  n  =  10,  20,  and 
30,  this  formula  was  used  to  compute  values  of  V0.b((pB)  for  n  from  31 
to  50.  These  values  are  tabulated,  together  with  the  known  values  for 
n  =  10,  20,  and  30,  in  Table  16.4  and  provide,  it  is  believed,  a  reasonably 
accurate  picture  of  the  convergence  of  V(<pB)  toward  its  theoretical  value 
of  X/n.  The  values  of  Vo.5(<pb)  given  in  Table  16.4  are  slightly  larger  than 
the  values  of  the  variance  of  y  given  in  the  table  of  the  y  transformation 
referred  to  above,  but  for  n  >  25  the  excess  is  very  slight,  being  roughly 
1  unit  or  less  in  the  fourth  decimal  place. 

Fisher  and  Yates  have  suggested  that,  when  special  accuracy  is 
desired,  observed  angles  be  replaced  by  adjusted  angular  values  in  much 
the  same  manner  as  observed  pr obits  are  replaced  by  adjusted  pr obits. 
They  have  given  a  table  to  facilitate  such  adjustments.*  The  adjust- 
ments proposed  by  Fisher  and  Yates  are  functions  of  the  actual  propor- 
tions observed  in  a  given  instance  and  thus  are  of  a  wholly  different  type 
from  Bartlett's  adjustment,  which  may  be  regarded  as  a  convenient  rule 
of  thumb.  The  theoretical  basis  of  the  adjustments  advocated  by 
Fisher  and  Yates  has  been  given  in  a  recent  paper  by  Cochran,  f 

Cochran  has  given  an  excellent  discussion  {  of  the  presentation  of 
results  of  statistical  analyses  in  which  transformations  of  original  data 
have  been  employed  to  facilitate  their  statistical  analysis.  Curtiss§  has 
provided  a  general  mathematical  theory  of  transformations  of  data  to 
render  their  analysis  more  tractable. 

*  Fisher  and  Yates,  Statistical  Tables,  Table  XIV. 

f  Cochran,  "The  Analysis  of  Variance  When  Experimental  Errors  Follow  the 
Poisson  or  Binomial  Laws,"  p.  342. 

J  Cochran,  "Analysis  of  Replicated  Experiments,"  p.  168  and  p.  170. 
§  Curtiss,  "On  Transformations." 
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In  conclusion,  it  must  be  remembered  that  while  in  large  samples  the 
variance  of  a  transformed  proportion  is  independent  of  the  true  propor- 
tion for  most  practical  purposes,  the  variance  of  the  transformed  value  <p 
is  still  proportional  to  1/n,  where  n  is  the  number  of  observations  upon 
which  the  observed  proportion  is  based.  Consequently,  if  pi,  p2,  ps,  .  .  . 
are  proportions  based  on  different  numbers  of  observations  ni,  n2,  n3,  .  .  . 
and  if  the  n's  differ  widely,  then  the  variances  of  the  corresponding  angular 
values  may  be  so  unequal,  because  of  the  variation  of  the  n's,  that  the 
advantages  of  the  transformation  are  lost  almost  entirely.  Similarly,  the 
advantages  of  the  inverse  sine  transformation  may  be  lost  to  a  large  extent 
when  the  data  to  be  analyzed  are  subject  to  other  forms  of  variability  in 
addition  to  sampling  variation  so  that  the  variance  of  an  observed  pro- 
portion p  is  given  by  P(l  —  P)/n  +  <r2e,  where  the  first  term  represents 
the  variation  of  simple  sampling  and  the  second  term  the  extraneous 
variation.  In  a  recent  paper*  Cochran  has  discussed  the  analysis  of 
fractions  or  percentages  based  on  different  numbers  of  observations  (a) 
when  extraneous  variation  is  absent  and  (b)  when  extraneous  variation 
is  present. 

4.  TECHNICAL  NOTES 
4.1.  General  Transformation  to  Stabilize  Variance 

4.1.1.  Basic  Theorem. — The  transformation  of  random  variables  to 
stabilize  variance  may  be  based  upon  the  following  theorem : 

Let  0,  (0i  <  6  <  02),  be  a  parameter  of  a  statistical  universe  (pop- 
ulation), and  let  T  be  an  estimator  of  0,  [Ti(0)  <  T  <  T2(0)],  based  on 
n  observations  from  the  universe  such  that 

Expected  value  of  T  m  E(T)  =  0 

Variance  of  T  =  V(T)  =  E(T  -  0)2  =  ^ 

Third  moment  of  T  about  0     =  ^{T)  =  E(T  -  0)3 

(12) 

as  n  — >  qo  |  ' 


4) 


Fourth  moment  of  T  about  0  =  m(T)  =  E(T  -  0)4 

as  n 


-® 


*  Cochran,  "Analysis  of  Variance  for  Percentages  Based  on  Unequal  Numbers." 
f  By  <p(n)  =  o(nk)  as  n  — ►  °o  we  mean  that  lim  <p(n)/nk  =  0.     In  particular,  o(l) 

n— >  oo 

means  a  function  that  tends  to  zero  as  n  — *  °° . 

By  <p(ri)  =  0(nk)  as  n  — >  «  we  mean  that  \<p{n)\  <  K  •  nk  for  n  >  n0,  where  K 
is  finite  and  does  not  depend  upon  n,  and  n0  is  some  (large)  value  of  n.  In  particular, 
0(1)  means  a  function  that  is  bounded  as  n  — >  w . 

This  notation  is  in  agreement  with  that  introduced  by  E.  Landau  in  1909  and  is 
now  well  established. 
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If  the  function 


kt)  =  f  -  r;_  n:s» 


dT 

Vg(T) 

and  its  first  two  derivatives  exist  for  all  admissible  values  of  T,  except 
perhaps  for  a  set  of  measure  zero,  and  if  the  mean  value  of  its  second 
derivative  and  of  the  square  of  the  second  derivative  are  finite  for 
0i  <  0  <  62  and  every  value  of  n,  that  is,  if 

E\f»(T)]  =  0(1) 
and 

E\P\T)Y  =  0(1)  (14) 

as  n  ->  oo  for  0i  <  0  <  02  and  7\(0)  <  T  <  T2(d),  then 


i  +  .ft 


The  first  of  assumptions  (12)  implies  that  T  is  an  unbiased  estimator 
of  0.  The  three  further  assumptions  are  introduced  for  the  purposes  of 
the  theorem  and  are  not  very  restrictive.  Indeed,  assumptions  (12)  are 
generally  satisfied  when  T  is  some  sort  of  average  that  has  been  adjusted 
so  as  to  be  an  unbiased  estimator  of  a  population  parameter  of  interest. 

4.1.2.  Proof  of  Basic  Theorem. — Let  f(T)  be  a  function  of  T  such 
that  its  second  derivative  f(2)(T)  exists  for  7\(0)  <  T  <  T2(d),  except 
perhaps  for  a  set  of  points  of  measure  zero,  where  7\(0)  <  T  <  T2(6) 
is  the  admissible  range  of  values  of  T,  that  is,  P[Ti(B)  <  T  <  T2(0)|0]  =  1. 
Since  E(T)  =  0  by  assumption,  it  follows  that  0  lies  in  the  admissible 
range  of  values  for  T.  Hence  f(T)  may  be  expressed  as  follows,  with 
the  aid  of  Taylor's  formula  and  Lagrange's  form  of  the  remainder  term : 

KT)  =  /(»)  +  ^^/(1'«0  +  (T  ~  9)2/(2)(0  (16) 

where  t  =  6  +  %T(T  —  0),  0  <  £r  <  1,  and  the  notation  £r  is  used  to 
emphasize  the  fact  that  the  value  of  £  depends  upon  the  value  of  T  con- 
cerned, Ti(0)  <  T  <  7*2(0). 

Taking  the  mathematical  expectation  of  each  side  of  equation  (16), 
it  is  found  that 

E[f(T)]=f(d)+^-E[f^(t)]  (17) 

=  /(0)  +  o(l)         as  n  ->  oo 

by  virtue  of  assumption  (14)  and  the  second  of  assumptions  (12).  This 
shows  that  these  assumptions,  together  with  those  regarding  the  exist- 
ence and  differentiability  of  f(T),  are  sufficient  to  ensure  that,  as  n 
increases  without  limit,  the  expected  value  of  the  function  f(T)  converges 
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stochastically  to  /(0),  the  value  of  the  function  when  T  assumes  its 
expected  value. 

Subtracting  the  left  and  right  sides  of  (17)  from  the  left  and  right 
sides,  respectively,  of  (16),  squaring,  and  taking  expected  values,  it  is 
found  that 


V[f(T)}  =  E{f{T)  -E[f{TW  (18) 

/(2)Wl2M4(r) 
4 

J(1)(0)-£[/(2)(OWT) 


=  [/">(*)]»  •  V(T)  +  ^E^P\t)WKT)  +  E[f^(t)]^(T) 


+ 


In  view  of  the  assumptions  (12)  and  (14),  this  becomes 

V[f(T)}  =  [/<»(0)2  •  V(T)  +  o  (~j        as  n  -  oo  (19) 

=  [/<■>(«)]* -^  +"Q)        as«^« 

Consequently,  for  the  function  defined  by  equation  (13),  equation  (19) 
reduces  to  equation  (15).     This  completes  the  proof. 

Curtiss*  has  considered  somewhat  more  fully  the  mathematical 
aspects  of  transformations  of  random  variables  to  simplify  their  variances, 
and  his  paper  should  be  consulted  by  persons  interested  in  a  compre- 
hensive mathematical  treatment. 

4.2.  Applications  of  the  General  Procedure 

4.2.1.  Proportion  of  Items  Possessing  a  Particular  Attribute  in  a 
Sample. — Let  6  =  P,  the  proportion  of  items  in  an  infinite  population 
that  possess  a  particular  characteristic  A,  and  let  T  =  p,  the  proportion 
of  items  possessing  A  in  a  sample  of  n  items  drawn  independently  at 
random  from  the  population.     It  is  well  known  that 

E(p)  =  P 

V(p)  =  P(1~P)  (20) 

M3(p)  =  p«  -  *&  ~  g) 
,   N       3P2(1  -  PY    ,   P(.l  -  P)(l  -  6F  +  6P2) 

M4(P)     = o H " — 5 " 

Clearly  ns(p)  and  ^(p)  are  both  o(l/n)  as  n  — >  oo ,  and  g(P)  =  P{\  —  P). 
Consequently,  if  there  is  a  transformation  that  will  stabilize  the  variance, 
the  theorem  implies  that  it  will  be 

/(p)  =    f  dv  =  2  arcsin  Vp  +  C  =  arccos  (1  -  2p)  +  C     (21) 

y  vp(i  -  p) 

*  Curtiss,  "On  Transformations  Used  in  the  Analysis  of  Variance." 


Sec.  4.2.2     INVERSE  SINE  TRANSFORMATION  OF  PROPORTIONS       411 

where  C  is  an  arbitrary  constant,  which  may  conveniently  be  taken  as  0. 
This  yields  the  transformation  <p(p),  denned  in  equation  (4),  <p(p)  being 
expressed  in  radian  measure.  It  is  not  difficult  to  show  that  the  second 
derivative  of  <p{p)  satisfies  the  requirements  of  the  transformation 
theorem  for  0  <  P  <  1,  whence 


n  \nj 


V[<p(v)]  =  ~  +  o^-J         as  n  ->  oo  (22) 

4.2.2.  Mean  of  a  Sample  from  a  Poisson  Exponential  Population. — 

Let  0  =  n,  the  parameter  (mean)  of  a  Poisson  exponential  population, 
and  let  T  =  x,  the  mean  of  a  random  sample  of  n  independent  observa- 
tions from  this  population.     It  is  well  known  that 

E(x)  =  M 


V(x) 

-  t 
n 

Ms(£) 

n2 

fii(x) 

(23) 


n6 


Clearly  ^(x)  and  /Z4(#)  are  both  o(l/n)  asn->  oo ,  and  g(fi)  =  \x.  Conse- 
quently, the  theorem  implies  that,  if  there  exists  a  transformation  that 
will  stabilize  the  variance,  it  will  be 

J{x)  =    [~  =  2V*  +  C  (24) 

J  Vx 

It  is  convenient  to  take  C  =  0.  Bartlett*  chose  to  drop  the  2  also 
and  recommended  the  transformation  y  =  V^-  The  first  and  second 
derivatives  of  y  with  respect  to  x  are 


and 


V'  =  — ^=         V"  =  tt=L  (25) 

2yx  4(x)3/2  v     - 


E(y')  =  —  approximately 

2  Vm 

#(2/")  =  4-p^  approximately  (26) 


[E(y")]2  =  j^~3  approximately 
*  Bartlett,  "Square  Root  Transformation." 
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Hence,  equation  (18)  above  becomes  for  y  =  \/x 

V{v)  =k  +  ^  +  k\^-  +  ik)-  si2 approximate'y    (27) 


4n       32Mn2    '    64M2n3 
Therefore,  as  n  — >  <*>  for  fixed  yu 

y^  =  i  +  0@  <28> 


and  as  m  ~>  °°  f°r  fixed  n 


F^=i+°(y  <»> 


Consequently,  if  either  n  or  /z  is  large,  F(?/)  =  l/4n,  approximately. 

4.2.3.  Estimate  of  the  Variance  of  a  Normal  Population  Based  on  n 
Degrees  of  Freedom. — Let  6  =  a2,  the  variance  of  a  normal  population, 
and  let  T  =  s2,  an  estimate  of  <r2  such  that  ns2/a2  is  distributed  as  x2 
for  n  degrees  of  freedom.     It  is  not  difficult  to  show  that 

E(s2)  =  a2 
V(s2)        2*4 


n 

W  (30) 


M4(s2)  =  ia(»  +  4V 


rr 


Clearly  /x3  and  ju4  are  o(l/n)  as  n  — >•  oo  and  #(cr2)  =  (o-2)2.  Therefore,  the 
theorem  implies  that,  if  a  transformation  that  will  stabilize  the  variance 
exists,  it  will  be 

f(s2)  =     I      "Z    ;        "^  ;      \-C  =-  V2\oges+C  (31) 


J    V2s2         V2 


For  convenience,  we  may  take  C  =  0  and  consider  the  transformation 
y  =  loge  s2  =  2.30259  logio  s2.     Since 

■^(2/0  =  ~~£  approximately 

E(y")  =  —  i  approximately  (32) 

[#(?/")]2  =  -g  approximately 
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the  type  of  reasoning  employed  in  connection  with  the  preceding  example 
shows  that 

y(loge  s2)  =  -  +  o  I  -  J         as  n  — >  oo  for  fixed  a2  (33) 


71 


F(loge  s2)  = h  o  (  —  J         as  <t2  — >  oo  f  or  fixed  n 


(34) 


Consequently,  F(loge  s2)  =  2/ft,  approximately,  provided  either  ft  or  a2 
is  sufficiently  large. 

4.2.4.  Correlation  Coefficient  Computed  from  a  Sample  from  a  Normal 
Population. — Let  6  =  p,  the  product  moment  coefficient  of  correlation 
in  a  bivariate  normal  population,  and  let  T  =  r,  a  correlation  coefficient 
computed  from  a  sample  of  n  observations  drawn  independently  at 
random  from  the  population.  The  mean,  variance,  and  higher  moments 
of  r  are  exceedingly  complicated  functions  of  p  and  n,  but  when  n  is 
large  and  \p\  <C  1, 

E(r)  =  p  approximately 

(1  —  p2)2 
V(r)  =  approximately 

/  n  A\  (35) 


«a) 


A14W  =0l~ )         as  n  — »  00 

Consequently,   the  theorem   suggests  the   use   of  the  following  trans- 
formation to  stabilize  the  variance: 

Taking  (7  =  0,  this  yields 

z  =  I  loge  y^  =  tanh"1  r  (37) 

the  transformation  of  r  introduced  by  R.  A.  Fisher.  * 

4.3.  A  Method  of  Discovering  Bartletfs  Adjustment 

Let  the  values  of 

<p(p)  =  2  arcsin  -\/p  (38) 

*  Fisher,  "On  the  'Probable  Error'  of  a  Coefficient  of  Correlation  Deduced  from 
a  Small  Sample,"  Metron,  Vol.  1  (1921),  pp.  1-32. 
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where 

p  =  l        (x  =  0,  1,  2,  •  •  •  ,  n)  (39) 

be  plotted  on  the  arithmetic  scale  of  arithmetic-probability  paper  against 
the  cumulative  probability 


V 

£j  r\(n  —  r)\ 

r  =  0 


n*Pr(l-P)«-r  (40) 


plotted  on  the  probability  scale,  for  a  particular  value  of  n  and  a  selected 
value  of  P.  If  n  >  10  and  0.05  <  P  <  0.95,  it  will  be  found  that  the 
plotted  points  f or  2  <  x  <  n  —  2  lie  very  nearly  on  a  straight  line,  that 
the  points  for  x  =  1  and  x  =  n  —  1  depart  slightly  from  the  extension 
of  this  line,  but  that  the  points  for  x  —  0  and  x  =  n  lie  far  off  the  exten- 
sion of  this  line.  If  the  values  ^>(0)  =0  and  <p(l)  =  tt  are  replaced  by 
scores  <p(0)  =  an  and  <p(l)  =  bn  such  that  the  corresponding  plotted 
points  fall  on  a  gently  curving  extrapolation  of  the  curve. through  the 
plotted  points  for  2  <  x  <  n  —  2,  it  will  be  found  that 

an  =  2  arcsin  */j-  approximately  .  ,     . 

bn  =  7r  —  an  approximately 

The  foregoing  suggests  that  a  better  adjustment  than  Bartlett's 
could  be  obtained  by  extending  in  both  directions  the  line  of  best  fit 
through  the  points  for  2  <  x  <  n  —  2  and  using  this  line  to  define  the 
scores  for  x  =  0,  1,  n  —  1,  and  n.  This  was  tried  and  some  improve- 
ment was  noted,  but  for  n  >  10  and  0.05  <  P  <  0.95  the  improvement 
did  not  appear  to  be  worth  the  trouble. 

This  procedure  may  be  found  useful  for  obtaining  adjustments  of  the 
Bartlett  type  in  connection  with  other  transformations  to  stabilize  variance. 

5.  REFERENCES* 

6.1.  Inverse  Sine  Transformation 

Bartlett,  M.  S. :  "Square  Root  Transformation  in  Analysis  of  Variance,"  Supplement 
to  the  Journal  of  the  Royal  Statistical  Society,  Vol.  3  (1936),  pp.  68-78. 

*  As  this  volume  went  to  press,  binomial-probability  graph  paper  utilizing  the 
inverse  sine  transformation  was  put  out  by  the  Codex  Book  Company,  Inc.,  Norwood, 
Mass.  The  paper  was  designed  by  Frederick  Mosteller  and  John  W.  Tukey,  pursuant 
to  a  suggestion  published  in  1943  by  R.  A.  Fisher  and  K.  Mather  (Annals  of  Eugenics, 
Vol.  12,  p.  13).  Messrs.  Mosteller  and  Tukey  plan  to  write  an  explanation  of  its 
design  and  uses,  probably  for  the  Journal  of  the  American  Statistical  Association. 
In  the  meantime,  they  have  prepared  a  brief  statement  that  may  be  obtained  from 
the  Codex  Book  Company. 

The  paper  may  be  ordered  in  two  weights,  heavy  (no.  32,298)  and  light  (no. 
31,298).     The  sheets  measure  8^  by  11  in. 
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CHAPTER  17 
ELEMENTS  OF  SEQUENTIAL  ANALYSIS* 

1.  INTRODUCTION 

1.1.  Nature  of  Sequential  Tests 

A  sequential  test  of  a  statistical  hypothesis  consists  of  making  a 
certain  calculation  after  each  observation  (or  group  of  observations)  and 
deciding  on  the  basis  of  this  calculation  whether  (1)  to  accept  the  hypothe- 
sis under  test,  (2)  to  accept  the  alternative  to  that  hypothesis,  or  (3)  to 
suspend  judgment  pending  the  examination  of  more  data.  The  criteria 
for  these  decisions  are  such  that  the  hypothesis  is  accepted  or  rejected  as 
soon  as,  but  not  before,  the  data  are  sufficient  for  doing  so  with  prede- 
termined degrees  of  reliability. 

1.2.  Advantages  and  Disadvantages 

Sequential  is  superior  to  nonsequential  analysis  whenever  (1)  the 
data  become  available  serially  and  (2)  the  cost  of  the  data  (in  terms  of 
time,  labor,  material,  and  so  forth)  is  more  or  less  proportional  to  the 
amount  of  data.  Nonsequential  analysis  is  superior  whenever  (1)  the 
amount  of  data  is  fixed  or  (2)  the  cost  of  the  data  is  largely  overhead, 
hence  more  or  less  independent  of  the  amount  of  data.  Superiority  con- 
sists in  minimizing  a  set  of  quantities  (N,a,fi),  where  N  is  the  number  of 
observations,  a  is  the  risk  (that  is,  the  probability)  of  erroneously 
rejecting  the  hypothesis  under  test,  and  (3  is  the  risk  of  erroneously 
accepting  this  hypothesis.  Sequential  and  nonsequential  tests  differ  in 
the  constraints  under  which  this  set  is  minimized.  Nonsequential  tests 
treat  N  as  fixed  and  are  so  designed  that  with  either  risk  a  or  /3  fixed  the 
other  is  minimized.     Sequential  tests  treat  the  number  of  observations  N 

*  Sequential  analysis  was  developed  in  the  Statistical  Research  Group  in  1943, 
the  theory  principally  by  A.  Wald  and  the  techniques  of  application  by  numerous 
members  of  the  group,  including  Kenneth  J.  Arnold,  H.  A.  Freeman,  Milton  Friedman, 
M.  A.  Girshick,  Edward  Paulson,  A.  Wald,  and  W.  Allen  Wallis. 

This  chapter  is  an  extension  of  a  memorandum  by  the  author  of  the  chapter, 
"Sequential  Tests  of  Statistical  Significance,"  which  is  cited  as  "SRG  180"  in  Appen- 
dix B  of  Sequential  Analysis.  A  more  extensive  exposition  of  the  theory  than  is  given 
here  will  be  found  in  Appendix  B  of  Sequential  Analysis,  and  a  more  extensive  sum- 
mary of  methods  of  application  than  is  given  here  will  be  found  in  "Sequential  Tests." 
For  a  complete  and  rigorous  account  of  the  theory,  see  "Sequential  Tests";  for  a  full 
account  of  methods  of  application  and  interpretation,  including  computational  aids, 
see  Sequential  Analysis.      (For  full  citations,  see  Sec.  8.) 

419 
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as  a  variable  and  are  so  designed  that,  for  fixed  risks  a  and  0,  the  expected 
(that  is,  average)  number  of  trials  iV"  required  to  reach  a  decision  is 
minimized.  If  for  a  nonsequential  test  N  is  made  large  enough  so  that, 
with  a  fixed,  /3  will  not  exceed  a  predetermined  amount,  this  value  of  N 
will  exceed  (frequently  by  as  much  as  100  percent)  the  N  required  with 
a  sequential  test  for  the  same  a  and  13.  Thus,  when  N  is  readily  subject 
to  variation,  sequential  tests  are  superior;  when  N  is  not  readily  varied, 
nonsequential  tests  are  superior.* 

1.3.  General  Procedure 

The  basic  quantity  computed  after  each  observation  in  a  sequen- 
tial test  is  known  in  statistical  literature  as  the  " likelihood  ratio." 
The  probability  of  obtaining  the  observations  actually  at  hand  is  com- 
puted first  on  the  assumption  that  the  hypothesis  under  test  (designated 
the  "null  hypothesis")  is  true,  and  second  on  the  assumption  that  the 
alternative  to  this  hypothesis  (the  "alternative  hypothesis")  is  true.  If 
these  probabilities  are  designated  by  P0  and  Ph  respectively,  the  likeli- 
hood ratio  is  simply  Pi/Po.  In  a  sequential  test  it  is  computed  after  each 
observation  (or  group  of  observations),  each  computation  being  based 
on  all  the  observations  available.  If  the  likelihood  ratio  ever  reaches  a 
certain  level  A  (A  >  1),  the  collection  of  data  (experiment)  halts  and 
the  alternative  hypothesis  is  accepted.  If  the  likelihood  ratio  ever  falls 
to  a  certain  level  \/B  (B  >  1),  the  experiment  ceases  and  the  null 
hypothesis  is  accepted.  When  the  likelihood  ratio  is  between  A  and 
l/B,  the  evidence  is  insufficient  for  either  decision  with  the  required 
reliability.  The  required  reliability,  that  is,  the  a  and  #  of  the  preceding 
paragraph,  completely  determines  A  and  B;  in  fact,  A  =  (1  —  fi)/a  and 
B  =  (l-a)/(3. 

Since  in  most  cases  the  successive  observations  are  independent,  the 
likelihood  ratio  for  a  set  of  observations  is  simply  the  likelihood  ratio  for 
the  last  observation  multiplied  by  that  for  the  set  of  preceding  observa- 
tions. Consequently,  using  the  logarithm  of  the  likelihood  ratio  reduces 
the  arithmetic  to  addition  and  subtraction.  After  each  observation, 
simply  add  log  Pi  to  and  subtract  log  P0  from  the  algebraic  sum  of  such 
quantities  for  all  preceding  observations.     If  this  sum  ever  reaches 

a  =  log  A  =  log (1) 

a 

accept  the  alternative  hypothesis;  if  it  ever  falls  to  —b,  where 

*  A  detailed  discussion  of  the  conditions  under  which  sequential  analysis  is 
advantageous  and  those  under  which  it  is  disadvantageous  is  given  in  Statistical 
Research  Group,  Sampling  Inspection,  pp.  33-38  and  91-97.  Although  the  discussion 
there  is  in  terms  of  specific  circumstances,  its  generalization  is  obvious, 
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b  -  log  B  =  log  )-^  (2) 

accept  the  null  hypothesis.     Otherwise,  continue  the  experiment. 
1.4.  Simplifications 

The  preceding  paragraph  is  applicable  whatever  the  null  and  alterna- 
tive hypotheses  may  be,  provided  only  that  all  parameters  of  each 
hypothesis  are  uniquely  specified.  In  particular  problems,  the  procedure 
can  usually  be  simplified  by  expressing  log  P±  and  log  P0  directly  in 
terms  of  the  observations,  then  adjusting  the  termination  levels  to  simplify 
the  expression.  This  is  illustrated  below  for  three  common  tests:  (1)  for 
the  probability  of  success  in  a  binomial  distribution;  (2)  for  the  mean  in 
a  normal  distribution;  and  (3)  for  the  difference  in  probability  of  success 
between  two  binomial  distributions.  The  arithmetic  can  be  reduced  in 
the  first  and  third  cases  to  mere  counting  and  in  the  second  case  to  simple 
addition. 

2.  BINOMIAL  DISTRIBUTION 
2.1.  Algebraic  Simplification  of  General  Procedure 

Suppose  it  is  required  to  determine  whether  the  proportion  p  of  suc- 
cesses, from  a  certain  type  of  operation  that  results  in  either  success  or 
failure,  is  po.  The  alternative  against  which  this  is  to  be  tested  is  that 
p  =  Pi  (pi  >  Po).  The  value  of  p0  may  be  the  minimum  acceptable 
proportion  for  the  operation,  it  may  be  a  value  observed  from  long 
experience  with  similar  operations,  or  it  may  have  been  deduced  from 
theoretical  reasoning.  The  value  of  pi  may  be  the  least  excess  over  p0 
worth  detecting,  the  proportion  expected  on  theoretical  grounds,  and  so 
forth.  Suppose  that,  after  N  operations,  S  successes  have  occurred. 
The  probability  of  this  happening  (in  the  sequence  actually  observed)  is 
pf(l  —  pi)N~s  under  the  alternative  hypothesis  and  pg(l  —  po)N~s 
under  the  null  hypothesis.     The  likelihood  ratio  is  therefore 


(i  -  po 

(1  -  Po) 


N—S 


and  its  logarithm  is 

S.og(g)  +  (JV-S)log[([^-)] 
The  hypothesis  that  p  =  p0  is  to  be  rejected,  and  the  hypothesis  that 
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p  =  pi  accepted,  if  this  is  ever  as  great  as  a,  that  is,  if  S  ever  reaches 

a  +  N  log  fc-2?) 

log  r  ~ Po  1 

This  is  simply  a  linear  function  of  JV  with  positive  intercept.     Similarly, 
the  hypothesis  that  p  =  po  is  to  be  accepted  if  S  is  ever  as  small  as 

-b  +  N  log  (j— 1*) 

8  Lp»(1   "  Pi)  J 
another  (and  parallel)  linear  function  of  iV  with  negative  intercept. 

2.2.  Graphical  Procedure 

Thus,  the  simplest  procedure  is  to  prepare  a  chart  with  N  on  the 
horizontal  and  S  on  the  vertical  axis.  On  this  chart  the  two  parallel 
lines  Si  and  So  are  drawn.  After  each  observation  is  obtained,  the  total 
number  of  successes  observed  is  plotted  against  the  total  number  of 
observations  made.  If,  for  any  N,  S  is  as  great  as  Si,  discontinue  the 
experiment  and  accept  the  hypothesis  that  p  =  pi.  If,  for  any  JV,  S  is 
as  small  as  $o,  discontinue  the  experiment  and  accept  the  hypothesis 
that  p  =  po.  When  S  is  between  So  and  Si,  no  decision  is  possible  with 
a  risk  as  small  as  a  or  /3.  A  variation  of  this  procedure  is  to  plot  the 
number  of  failures  F  (=  N  —  S)  on  the  horizontal  axis  instead  of  N, 
retaining  S  for  the  vertical  scale.  Then  the  two  decision  lines  become, 
respectively, 

a  +  F  log  ({— P?) 

s,  =  V^pJ  (5) 

log?-1 

Po 

and 

S;  = V^pJ  (6) 

log^ 
Po 

Caution:  Be  sure  that  the  same  logarithmic  base  is  used  for  a  and  6 
as  is  used  elsewhere  in  Si  and  S2. 
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3.  NORMAL  DISTRIBUTION 

3.1.  Algebraic  Simplification  of  General  Procedure 

Suppose  the  standard  deviation  of  a  normal  (Gaussian)  distribution 
is  known  to  be  a  and  that  it  is  required  to  test  the  null  hypothesis  that  the 
mean  m  is  m0  against  the  alternative  hypothesis  that  m  =  mi  (mi  >  m0). 
If  N  observations,  xh  X2,  ...  ,  xN,  are  obtained,  the  probability  of  this 
sequence  is 


iV 

(r  V^)-»  exP  {  -  ^  [X  {Xi  ~  mo)  J I 

t  =  1 


(7) 


under  the  null  hypothesis  and  the  same  expression,  except  with  mi 
replacing  m0,  under  the  alternative  hypothesis.  From  these  probabilities 
the  natural  logarithm  of  the  likelihood  ratio  is  found  to  be 


iV 


mi  —  m0  /  V  m"  +  mi  aA  fo\ 

—?—  \lfXi 2T~  N)  (8) 


i  =  i 

The  hypothesis  that  m  =  mx  is  to  be  accepted  if  this  is  ever  as  great  as  a, 
that  is,  if  2#  is  ever  as  great  as 

mi  —  m0  2 

Similarly,  the  hypothesis  m  =  mo  is  to  be  accepted  if  *Lx  is  ever  as  small 
as 

M         ^±l_       m1+moN  ( 

mi  —  m0  2 

3.2.  Graphical  Procedure 

As  in  the  case  of  the  binomial,  prepare  a  chart  with  N  on  the  hori- 
zontal axis  and  Sx  on  the  vertical  axis,  and  draw  on  it  the  two  parallel 
straight  lines  Mi  and  Mo,  of  which  Mi  has  a  positive  and  M 0  a  negative 
intercept.  As  each  observation  is  obtained,  add  it  to  the  sum  of  all 
previous  observations  and  plot  the  total  against  N.  If  the  plotted  point 
is  ever  as  high  as  Mi,  terminate  the  experiment  and  accept  the  hypothesis 
that  m  =  mi.  If  it  is  ever  as  low  as  Mo,  terminate  the  experiment  and 
accept  the  hypothesis  that  m  =  m0.  When  it  is  between  M0  and  Mh  no 
decision  can  be  reached  without  a  risk  of  error  exceeding  a  or  /?. 

Caution:  Be  sure  that  a  and  b  are  natural  logarithms.  (Natural 
logarithms  are  2.3  times  the  corresponding  common  logarithms.) 
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4.  DOUBLE  DICHOTOMIES 

4.1.  Null  and  Alternative  Hypotheses 

Data  from  two  different  operations,  each  of  which  results  in  either 
success  or  failure,  constitute  a  double  dichotomy.  To  determine  sequen- 
tially whether  the  proportions  of  successes  are  the  same  for  the  two 
operations  (which  is  logically  identical  with  testing  whether  the  two 
operations  are  represented  in  the  same  proportion  among  the  successes 
as  among  the  failures),  set  up  the  null  hypothesis  that  the  ratio  of  suc- 
cesses to  failures  is  the  same  for  the  two  operations,  and  the  alternative 
hypothesis  that  the  ratio  of  successes  to  failures  is,  u  (u  >  1)  times  as 
great  for  the  second  operation  as  for  the  first.  Thus,  if  pi  and  p2  are 
the  proportions  of  successes  for  the  two  operations,  the  null  hypothesis 

is  that       ,,,, *-U  =  1,  which  is  equivalent  to  pi  =  p2.     The  alterna- 

Pi/(1  -  Vi) 

tive  hypothesis  is  that      ,,., ^l  =  u,  which  is  equivalent  to  the  relation 

Pi/(1  -  Pi) 
pi  =  upi/[l  +  pi(u  —  1)].  Then  collect  observations  in  pairs,  each  pair 
containing  one  observation  for  each  operation.  Those  pairs  in  which  both 
operations  succeed  or  both  fail  are  disregarded,  since  they  throw  no  light 
on  the  question  of  which  operation  yields  the  higher  proportion  of  suc- 
cesses. Only  the  residual  pairs,  in  which  one  and  only  one  operation 
results  in  a  success,  are  considered  in  the  remainder  of  this  section;  these 
residual  pairs  will  hereafter  be  called  successes  if  the  operation  expected 
under  the  alternative  hypothesis  to  be  superior  produces  a  success. 

4.2.  Algebraic  Simplification  of  General  Procedure 

Under  the  null  hypothesis,  the  proportion  of  such  successes  among  the 
residual  pairs  will  be  J^.  Under  the  alternative  hypothesis,  the  propor- 
tion of  such  successes  will  be  u/(l  +  u).  Thus,  the  problem  reduces 
to  that  of  the  binomial  distribution  discussed  in  Sec.  2.  If  S  is  the 
number  of  successes  and  if  F  is  the  number  of  failures  from  N  residual 
pairs  (S  +  F  =  N),  the  logarithm  of  the  likelihood  ratio  may  be 
written 


Mog_^_+flogr|_  (11) 


The  alternative  hypothesis  (that  the  second  operation  is  superior)  is 
to  be  accepted  if  this  is  ever  as  great  as  a;  this  will  occur  if,  for  any  F,  S 
is  as  much  as 
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a  +  Flog  —^— 

logYTu 

a  linear  function  of  F  with  positive  intercept.  Similarly,  the  null 
hypothesis  (that  the  two  operations  are  equally  good)  is  to  be  accepted 
if  the  logarithm  of  the  likelihood  ratio  is  ever  as  small  as  —  b,  which  will 
occur  if,  for  any  F,  S  is  as  small  as 

S»  =  — Tu (13) 

a  linear  function  of  F,  parallel  to  Si  but  with  negative  intercept. 

4.3.  Graphical  Procedure 

Thus,  the  simplest  procedure  is  to  prepare  a  chart  on  which  the  hori- 
zontal axis  represents  F,  the  number  of  residual  pairs  in  which  a  failure 
has  occurred  with  the  operation  expected  under  the  alternative  hypothe- 
sis to  be  superior,  and  the  vertical  axis  represents  S,  the  number  of  residual 
pairs  in  which  that  operation  has  succeeded.  Draw  on  this  chart  the 
two  parallel  straight  lines  So  and  Si.  After  each  residual  pair  is  observed, 
plot  a  point  whose  coordinates  represent  the  total  number  of  successes 
and  failures  among  all  residual  pairs  obtained  so  far.  If,  for  any  value 
of  F,  S  is  as  great  as  Si,  halt  the  experiment  and  conclude  that  the  second 
operation  is  superior.  If,  for  any  value  of  F,  S  is  as  little  as  So,  halt  the 
experiment  and  conclude  that  the  two  operations  are  equally  good. 
When  S  is  between  So  and  Si,  no  conclusion  can  be  reached  without  a 
risk  of  error  in  excess  of  a  or  j8.  An  alternative  procedure  is  to  let  the 
horizontal  scale  represent  the  number  of  residual  pairs  N  (  =  S  +  F),  in 
which  case  the  decision  lines  become,  respectively, 

a  +  N  log  ±±* 

SI  =  j —  (14) 

log  u 

and 

-6  +  iVlog^+Jf 

S'0  =  i —  (15) 

log  u. 

Caution:  Be  sure  that  the  same  logarithmic  base  is  used  for  a  and  b 
as  is  used  elsewhere  in  So  and  Si. 
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5.  OTHER  APPLICATIONS 

5.1.  Simple  Problems 

Obviously,  the  general  procedure  and  the  simplifications  described 
in  Sees.  1.3  and  1.4  can  be  applied  to  a  wide  range  of  problems.  The 
application  is  straightforward  whenever  the  problem  is  simple,  in  that  a 
single  parameter  0  is  under  investigation  and  the  purpose  of  the  analysis 
is  to  discriminate  between  a  null  hypothesis  that  this  parameter  lies 
below  above  some  level  0o  and  an  alternative  hypothesis  that  it  lies 
above  below  another  level  0i,  where  0i  >  0O  [0i  <  0<J.  For  example, 
procedures  for  testing  the  mean  of  a  Poisson  distribution  and  for  testing 
the  standard  deviation  of  a  normal  distribution  are  easily  derived  from 
the  general  procedure.  More  complicated  problems  can  sometimes  be 
reduced  to  a  simple  form  for  which  the  procedure  is  straightforward,  as  is 
illustrated  by  the  case  of  double  dichotomies. 

5.2.  Complex  Problems 

Sequential  procedures  have  also  been  worked  out  successfully  for 
certain  more  complicated  problems.  Important  examples  are  (1)  test- 
ing the  null  hypothesis  that  the  mean  of  a  normal  distribution  has  a 
certain  value  m0  against  the  two-sided  alternative  hypothesis  that  the 
mean  is  either  greater  than  mi  or  less  than  2m0  —  m\  (mi  >  m0) ;  (2)  testing 
the  same  null  and  alternative  hypotheses  as  in  Sec.  3,  but  with  no  infor- 
mation about  the  standard  deviation  other  than  that  given  by  the  sequen- 
tial sample  (the  sequential  equivalent  of  Student's  test) ;  (3)  testing  the 
standard  deviation  of  a  normal  distribution,  with  no  information  about 
the  mean  other  than  that  given  by  the  sequential  sample ;  (4)  testing  the 
fraction  of  a  normal  distribution  exceeding  a  certain  level,  with  no 
information  about  the  mean  or  standard  deviation  other  than  that 
given  by  the  sequential  sample. 

6.  OPERATING  CHARACTERISTICS 
6.1.  General 

Sequential  tests  are  determined  by  four  quantities,  the  values  0o 
and  0i  specified  by  the  null  and  alternative  hypotheses  for  the  parameter 
being  investigated  and  the  values  a  and  /3  of  the  two  risks  of  error.  For 
simple  problems  involving  independent  observations,  for  example,  those 
described  in  Sees.  2,  3,  and  4,  the  sequential-test  procedure  involves  only 
three  parameters,  the  slope  s  and  the  two  intercepts  —hi  and  ~hi  of  the 
two  decision  lines.  Obviously,  then,  many  combinations  of  0O,  0i,  a,  and 
/3  lead  to  the  same  sequential  test.  By  finding  these  combinations,  we 
can  determine  the  probability  that  the  test  will  lead  to  acceptance  of 
the  null  hypothesis  for  any  possible  value  of  0,  not  merely  0O  and  0i. 
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The  function  (curve)  giving  the  probability  of  acceptance  Le  for  every 
possible  value  of  0  is  known  as  the  operating-characteristic  function 
(OC  curve)  of  the  test  procedure;  and  1  —  Le  is  known  as  the  power 
function. 

6.2.  Five-point  OC  Curves 

Although  the  equations  of  these  curves  are  known,  it  ordinarily 
suffices  to  find  five  easily  determined  points  on  them.  Two  points  are 
known  from  the  fact  that  at  the  extreme  values  of  0,  for  example,  at 
p  =  0  and  p  =  1  in  the  case  of  the  binomial  distribution,  the  probabilities 
of  acceptance  are  1  and  0.  Two  more  points  are  known  from  the  fact 
that  the  test  was  obtained  in  such  a  way  that  at  0O  and  0i  the  probabilities 
of  acceptance  are  1  —  a  and  /?.  A  fifth,  and  intermediate,  point  results 
from  the  fact  that  the  probability  of  accepting  the  null  hypothesis  is 


hi  +  h2 

(which  is  Yl  if  a  =  /3)  when  0  is  such  that  the  expected  value  of  each  term 
in  the  cumulated  sum  is  s,  for  example,  when  the  parameter  of  a  binomial 
distribution  is  p  =  s,  when  the  mean  of  a  normal  distribution  is  m  =  s, 
or  when  the  standard  deviation  of  a  normal  distribution  is  a  =  \/s. 

7.  AVERAGE  NUMBER  OF  OBSERVATIONS 
7.1.  General 

The  average  number  of  observations  N  necessary  to  reach  a  decision 
will  depend  upon  the  risks  of  error  and  upon  the  divergence  between 
the  null  and  alternative  hypotheses.  N  will  be  greater  the  smaller  a, 
the  smaller  |3,  or  the  smaller  the  divergence  between  the  null  and  alterna- 
tive hypotheses.  Furthermore,  it  will  have  one  value,  No,  when  the  null 
hypothesis  is  true  and  another,  Ni,  when  the  alternative  hypothesis  is 
true;  in  general  its  value  Ne  will  depend  upon  the  true  value  0  of  the 
parameter  being  tested.  The  function  (curve)  giving  the  average 
number  of  observations  Ne  for  every  possible  value  of  0  is  known  as  the 
average  sample  number  function  (ASN  curve).  Its  equation  may  be 
written 

N,  =  (1  ~  U)a  ~  Uh  (16) 

Ze 

where  Le  is  the  probability,  when  the  true  value  of  the  parameter  is  6, 
that  the  test  will  result  in  accepting  the  null  hypothesis,  and  ze  represents 
the  average  value  of  the  logarithm  of  the  likelihood  ratio  when  the  true 
value  of  the  parameter  is  6.  In  the  case  of  the  binomial  distribution, 
for  example,  the  likelihood  ratio  is  either  pi/po  (when  a  success  occurs), 
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or  (1  —  pi)/(l  —  po)  (when  a  failure  occurs).     If  the  true  proportion  of 
successes  is  p, 

h  =  V  log  g  +  (!->)  log  (l^|l)  (17) 


i  i  -  pi  i  i  r^^1  ~  po)i 

=  log  : — h  V  log     £— h £— r 

Si  -  Po  K  LPo(l  -  Pi)  J 


7.2.  Five-point  ASN  Curves 

Five  points  on  the  ASN  curve  are  readily  obtainable  from  the  five 
known  points  on  the  OC  curve.  Two  points  are  obtained  from  the 
extreme  values  of  0  and  the  corresponding  probabilities  of  acceptance, 
1  and  0.  Two  more  points,  corresponding  with  0O  and  61}  are  obtained 
by  taking  Le  equal  to  1  —  a  and  /3,  respectively.  The  fifth,  intermediate, 
point  is  obtained  by  multiplying  together  hi  and  h2  and  dividing  the 
product  by  the  variance  of  a  term  in  the  cumulated  sum  when  0  is  such 
that  the  expected  value  of  each  term  is  s;  for  example,  the  divisor  of 
/ii/i2  in  testing  the  binomial  proportion  is  s(l  —  s),  in  testing  the  mean 
of  a  Poisson  distribution  it  is  s,  in  testing  the  mean  of  a  normal  distribu- 
tion of  known  standard  deviation  a  it  is  a2,  in  testing  the  standard 
deviation  of  a  normal  distribution  it  is  2s2.  This  fifth  value,  Ns,  is  usually 
(but  not  necessarily)  near  the  maximum  of  the  ASN  curve. 

For  a  given  value  of  0,  sampling  fluctuations  will  cause  N  to  vary 
from  one  sequential  analysis  to  another,  using  the  same  values  of  hi,  h%, 
and  s.  The  distribution  of  N  is  skewed  in  such  a  way  that  most  samples 
require  fewer  than  N9  observations;  the  variability  is  such  that  the 
sample  size  rarely  exceeds  3No.  For  usual  combinations  of  0O,  0i,  a,  and 
j3,  the  ASN  curve  falls  off  sharply  from  its  peak  and  No  and  Ni  are 
substantially  less  than  Ns;  this  is  not  necessarily  the  case,  however, 
and  in  fact  it  is  possible  for  the  maximum  of  the  ASN  curve  to  occur 
outside  the  interval  0O  to  0i,  although  0  =  s  is  always  between  0O  and  0i. 
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by     sequential-sampling     procedures,     232- 

233,  422,  423,  425 
by  sign  of  difference  between  sample  means, 

377 
by  single  one-sided  criterion  based  on  x  and  s, 
36-41,  67-71 
figures  showing,  38-40 
by    single    two-sided    criterion    based    on    x 
and  s,  52,  54-56,  57 
figures  showing,  38-40,  55,  357 
for     "immediate     acceptance,"     in     variables 
inspection   for   fraction   defective,   34,   37, 
41,  67,  69-70 
Addition  theorem,  for  x2>  153 
Adjusted  data,  use  of,  in  variables  inspection  for 

fraction  defective,  36-41,  54,  69-71 
Adjusted  value,  of  chi  (Xc),  254-255,  257 
general  procedure  for  obtaining,  254n. 
of  chi-squared  (xc2),  254,  255 

general    procedure    for    obtaining,    reference 
giving,  254n. 
of  phi  (<pB),  formula  for,  401 
table  of,  406 
(See  also  Adjustment) 
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Adjustment,  Bartlett's,  399 

heuristic  derivation  of,  413-414 
table  to  facilitate,  406 
of  chi  (x),  when  calculated  from  2X2  table, 
254,  255,  257 
general  procedure  for,  254n. 
of    chi-squared,    when    calculated    from   2X2 
table,  254-255 
general  procedure  for,  reference  giving,  254n. 
constant     (A),     in     graphical     procedure      for 
variables  inspection  for  fraction  defective, 
36,  37,  69-71 
optimum  choice  of,  41,  54,  70,  71 
of   inverse-sine   transformation,   399,   405-407, 
407n. 
heuristic  derivation  of  Bartlett's,  413,  414 
table  to  facilitate  Bartlett's,  406 
of  k,  when  N  is  fixed,  19,  20 
of  N,  by  modifying  OC  curve,  19 
of  rejection  line,  to  allow  for  fixed  acceptance 
line,  358,  359 
example,  359-361 
of  s,  to  eliminate  bias,  58 

of    starting    point,    after    first    two    rounds    of 
sequential  procedure  for  locating  a  maxi- 
mum, 368,  369 
Yates',  254,  255,  257 
Air  testing  of  bombsights,  113-184 

arrangement  of  bombings  for,  121,  122 
function  of,  117 

multivariate  quality  control  of,  113-184 
problems  of,  116 

proposed  sampling  plan  for,  119-121 
a:  probability  of  erroneously  rejecting  hypothesis 
under  test,  271,  282,  283,  419 
taken  equal  to  0.50,  283 

use  in  determining  sequential  test  procedures, 
420,  421 
(See  also  Error,  of  the  first  kind) 
a:  probability  that  lot  of  acceptable  quality  will 
be  rejected,  16 
(See  also  Producer's  risk) 
a:  true  intercept  of  linear  regression  equations, 
322 
estimation  of,  when  cost  of  each  measurement 
depends  upon  its  magnitude,  322-328 
a-level  of  significance,  270,  278-279,  281 
Alternative   hypothesis,  in  analysis  of  variance, 
311-313 
for  which  probability  of  error  of  second  kind 
is  /3,  61,  231,  260n.,  263,  264,  271,  279,  313, 
380,  421,  423,  424 
in  tests  of  means,  61,  377,  379-380,  386,  423 
of  proportions  (percentages),  260n.,  263,  264, 

357-358,  421,  424 
of  standard  deviations  (variances),  231,  270, 
271,  278,  281,  311-313,  387 
American  Society  for  Testing  Materials,  58n.,  93 
American  Standards  Association,  58n.,  93 
Amount    of    information,    about    dependence    of 
one  variable  on  another,  321 
maximizing,  when  cost  of  each  measurement 
depends  upon  its  magnitude,  319-328 


Amount    of    information,    about    a    function    in 
neighborhood  of  maximum,  366 
provided  by  factorial  experiment,  366,  367, 

370,  371 
provided  by  sequential  experiment,  368,  369, 
370,  371 
about  population  mean  (ju),  provided  by  probit 
data,  345,  346 
maximizing,  349-352 
provided  by  sample  mean  (x),  161 
provided  by  sample  median,  161 
about     population     standard     deviation      (a), 
provided  by  probit  data,  345,  346 
maximizing  of,  349-352 
provided  by  sample  mean  deviation,  162,  163 
provided  by  sample  standard  deviation  (s), 
162,  229 
about    quality    of   lot,    provided    by    sampling 
inspection,  156-158 
when  inspection  is  by  attributes,  7 
when  inspection  is  by  variables,  7 
(See  also  Amount  of  inspection) 
Amount    of    inspection,     14,     15,     17-21,    26-28, 
60-66,   8*1,   82,   85,    100-109,   231,   233.    234, 
235-244,  259,  260,  263,  264,  274,  275,  278, 
280,   282,   283,   311-314,   319-328,   331-338, 
358,  362n.,  380,  381,  419,  420,  427,  428 
with  attributes  plans,   14,  15,  26-28,  237-244, 
259,  260n.,  263,  264,  427,  428 
comparison    of    double,     multiple,     and    se- 
quential sampling,  237-244 
figures  showing,  240-243 
contrasted  with  variables  plans,  26-28 
formula    for    determining,    with    sequential 
sampling,  427,  428 
for  single  sampling,  259,  260n.,  263,  264 
nomogram      for      determining,      for      single 
sampling,  260 
method  of  using,  260n. 
when   severity   of   any   one   test   cannot    be 
controlled  precisely,  331-338 
average,  14,  15,  26,  85,  233,  234,  235-244,  358, 
362ra.,  419,  420,  427,  428 
for     comparison     of     two     proportions,     by 

sequential  sampling,  424,  427,  428 
with     curtailed     (truncated)     sampling,     se- 
quential, 362n. 
single,  14,  15 
for      double      dichotomies,      by      sequential 

sampling,  424,  427,  428 
with  double  sampling,  compared  to  multiple 
and  sequential,  237-244 
figures  showing,  240,  242 
with  multiple  sampling,  compared  to  double 
and  sequential,  237-244 
figures  showing,  240,  242 
with    sequential    analysis,    contrasted    with 
non-sequential   sampling,    26,   85,    235- 
244,  419,  420 
general  formula  for   one-parameter  tests, 
427 
with  sequential  attributes  plans,  26,  235-244, 
427,  428 
when  acceptance  line  is  omitted,  358 
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Amount  of  inspection,  average,  with  sequential 

attributes     plans,      contrasted      with 

double-      and      multiple-      sampling 

attributes  plans,  237-244 

figures  showing,  240,  242 

contrasted    with    single-sample    attributes 

plans,  26 
formula  for,  427,  428 
with   sequential   one-decision   binomial   test, 

358,  427,  428 
with     sequential     one-sided     binomial     test, 
formulas  for,  427,  428 
figures  showing,  240,  242 
with  sequential  one-sided  test  for  a,  when  y. 
is  unknown,  formula  for,  233 
figures  showing,  234 
with    sequential    variables    inspection    plan 
for  fraction  defective,  85 
(See  also  ASN  curve) 
when  cost  of  each  measurement  depends  upon 

its  magnitude,  319-328 
in  curtailed  (truncated)  inspection,  14,  15,  362n. 
when  decisions  are  based  on  signs  of  differences 

between  sample  means,  formula  for,  380 
determination  of,  nomogram  for,  260 

procedures    for,     17-21,    27n.,    28n.,    64-66, 
81,  82,  231,  274,  275,  278,  280,  282,  283, 
311-314,  331-338,  362n.,  427,  428 
for  analysis-of- variance  tests,  311-314 
for    x2-tests    of    variance    estimates,    231, 

274,  275,  278 
for  curtailed   (truncated)   sequential  sam- 
pling, 362n. 
for  F-tests  of  variance  estimates,  280,  282, 

283,  296,  311-314 
for  one-decision  sequential  binomial   test, 

358,  427,  428 
when  severity  of  any  one  test  cannot  be 

controlled  precisely,  331-338 
for    standard    deviation    tests,    274,    275, 

278,  280,  282,  283 
for  variables  inspection  plans  for  fraction 
defective,  17-21,  27n.,  28n.,  64-66,  81-82 
differences     in,     attributes     versus     variables 
inspection,  26-28 
with      double,      multiple,      and      sequential 
sampling,  237-244 
figures  showing,  240-243 
sequential    versus    non-sequential    sampling, 
26,  85,  237-244,  419,  420 
figures  showing,  233,  240-243 
distribution  of,  in  repeated  sequential  sampling, 

428 
with   double  sampling,   compared   to   multiple 
and  sequential,  237-244 
figures  showing,  240-243 
figures    showing    dependence    on    quality    sub- 
mitted, 234,  240-243 
fixed  in  advance,  adjustment  of  k,  19,  20 
formulas  for,   17,  27n.,  28n.,  60-63,  231,  233, 
259,  260,  263-264,  278n.,  427,  428 
in     attributes    inspection,     with     sequential 
sampling,  427,  428 
with  single  sampling,  259,  260,  263,  264 


Amount  of  inspection,  formulas  for,  for  binomial 
test,    one-sided,    with   sequential    sam- 
pling, 427,  428 
with  single  sampling,  259,  260n.,  263,  264 
when  decisions  are  based  on  signs  of  differ- 
ences between  sample  means,  380 
for  difference  of  means  test,  single  sample, 
when  a  is  known,  61,  62 
when  a  is  unknown,  17,  27n.,  28n.,  60-66, 
81,  82 
for   difference    of   proportions    (percentages) 
test,  sequential  sampling,  424,  427,  428 
single  sample,  259,  260,  263,  264 
incorrect  formula  for,  258 
for  double-dichotomy  homogeneity  test,  259, 

260,  263,  264 
for  fraction  defective  test  based  on  variables, 
single  sample,  17 
when  n  is   known   and  a   unknown,   27n., 

28n.,  231,  278n. 
when  fj.  is   unknown  and  a  known,   27??., 

28n.,  62 
when  /i  and  a  are  both  unknown,  17,  27n., 
28n.,  60-63 
for  general   one-parameter   tests,   sequential 
sampling,  427 
single  sampling,  61,  62 
for  mean  test,  single  sample,  with  a  known, 
27n.,  28n.,  62 
with   a   unknown,    17,    27n.,    28n.,    60-66, 
81,  82 
for  proportion  test,  with  sequential  sampling, 
427,  428 
with  single  sample,  259,  260n.,  263,  264 
when  range  of  samples  of  k  determines  lot 
quality,  with  sequential  sampling,  233 
with  single  sample,  231,  278n. 
for  setting  tolerance  limits,  101,  108,  109 
for  standard  deviation  test,  with  sequential 
sampling,  233 
with  single  sample,  231,  278n. 
for    variables   inspection    plans    for    fraction 
defective,  with  single  sample,  17 
when  n  is  known  and  <r  is  unknown,  27n., 

28n.,  231,  278n. 
when  fi  is  unknown  and  a  is  known,  27n., 

28«.,  62 
when  n  and  a  are  both  unknown,  17,  27n., 
28n.,  60-63 
minimum    acceptable,    when    severity    of    any 
one   test   cannot   be    controlled    precisely, 
331-338 
minimum   proposed,    when    data    are    coarsely 

grouped  or  rounded,  213,  214 
with   multiple  sampling,   compared   to   double 
and  sequential,  237-244 
figures  showing,  240-243 
nomogram  for  determining,  260 

uses  of,  and  procedures,  260 
normal,  45 
reduced,  45 

reduction   in,   by   efficient   utilization   of  data, 
160-163 
by  knowing  /*  or  <r,  27,  28 
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Amount    of    inspection,    reduction    in,    by   using 
double  sampling,  237-244 
figures  showing,  240,  242 
by  using  efficient  formulas,  160-163 
by  using  multiple  sampling,  237-244 

figures  showing,  240,  242 
by  using  sequential  sampling,  26-28,  85,  233, 
237-244,  369-371,  419,  420 
figures  showing,  233,  240,  242 
by  using  variables  inspection,  26-28 
with  sequential  sampling,  26,  26n.,  85,  233-244, 
362n.,  419,  420,  427,  428 
when  acceptance  line  is  omitted,  358,  358n. 
in  attributes  inspection,  26,  235-244,  427,  428 
contrasted     with     double     and     multiple 
sampling,  237-244 
figures  showing,  240-243 
contrasted  with  single  sampling,  26 
contrasted     with    non-sequential    sampling, 

26-28,  85,  235-244,  419,  420 
with  curtailed  sampling,  362n. 
formula  for,  general  one-parameter  tests,  427 
one-sided  binomial  test,  427,  428 
one-sided  test  for  a,  when  ju.  is  unknown,  233 
procedures   for   determining,   curtailed   sam- 
pling, 362n. 
general,  427 

one-decision  binomial  test,  358,  427,  428 
in  variables  inspection  for  fraction  defective, 
85 
for  setting  tolerance  limits,  100-108 
with  single  sample,  14,  15,  17-21,  26-28,  60-66, 
81,  82,  85,  231,  259,  260,  263,  264,  274,  275, 
278,  280,  282,  283,  311-314,  331-338,  380, 
381 
analysis-of-variance  tests,  311-314 
attributes    plans,    14,    15,    26-28,    259,    260, 
263,  264 
contrasted    with  sequential   sampling,    26, 

419,  420 
contrasted      with      variables      plans      for 
fraction  defective,  26-28 
binomial   tests,    one-sided,   259,    260,    260n., 

263,  264 
x2-tests  of  standard  deviation  (or  variance), 

231,  274,  275,  278 
contrasted  with  sequential  sampling,  26,  85, 

419,  420 
curtailed,  14,  15 

when  decisions  are  based  on  signs  of  differ- 
ences between  sample  means,  380 
difference  of  means  test,   when  <r  is  known, 

61,  62 
difference    of    proportions    (or    percentages) 
test,  formula,  259,  260,  263,  264 
nomogram  for,  260 
double-dichotomy    homogeneity    tests,    259, 

260,  263,  264 
for     F-tests     of     standard     deviations      (or 

variances),  280,  282,  283,  311-314 
general  formula  for  one-parameter  tests,  61- 

62 
for  mean  test,  <r  known,  2.7,  28,  62 

a  unknown,  17,  27n.,  28rc.,  60-66,  81,  82 


Amount    of    inspection,    with   single   sample,   for 
proportion  test,  259,  260rc.,  263,  264 
when  range  in  samples  of  k  determines  lot 

quality,  231,  278rc. 
for  standard  deviation  tests,  231,  274,  275, 

278,  280,  282,  283 
for    variables    inspection    plans    for   fraction 
defective,  17-28,  60-66,  81,  82 
when  n  is   known   and  a  unknown,   27n., 

28n.,  231,  274,  275,  278 
when  ix  is  unknown  and  <r  is  known,  27,  28, 

62 
when  fj.  and  a  are  both  unknown,   17-21, 
26-28,  60-66,  81,  82 
standard  deviation  of,  238,  241,  243 

computational  methods  for  determining,  244 
figures  showing,  241,  243 
tables  showing,   for  analysis-of-variance  tests, 
313n. 
for  attributes  plans,  26 
for  setting  tolerance  limits,  102-107 
for  standard  deviation  tests,  272,  273,  276, 

277,  284-295,  298-309 
for  variables  inspection  for  fraction  defective, 
22-27,  65 
with  truncated  sampling,  sequential,  362n. 

single,  14,  15,  238 
variability  of,  235-244 

computational  methods  for  determining,  244 
figures  showing,  241,  243 

nature  of,  in  repeated  sequential  sampling,  428 
Analysis  of  consumption  requirements,  47-49 
Analysis   of  experimental  error   (see  Analysis   of 

variance) 
Analysis  of  production  possibilities,  46,  47 
Analysis  of  variance,  297,  310-314,  385,  386 
algebraic  expression  for  coefficient  of  between- 
group  variance,  312 
for  degrees  of  freedom  in,  312 
for  mean  squares  in,  312 
expected  values  of,  312 
for  sums  of  squares  in,  312 
of  binomial  data,  398,  407,  408 
components  of  random  variation  in,  297,  310,  311 

resolution  by,  311-314 
components  of  systematic  variation,  310,  311 
components  of  variance,  resolution  by,  311-314 
of  percentage  data,  398,  407-408 
planning  experiments  leading  to,  297,  310-314 
skeleton,  312 
systematic   versus   random   variation   in,    297, 

310-311 
table,  312 

tests  of  significance  in,  312-314,  385,  386 
for  components  of  variance,  312-314 
for  differences  among  subgroup  means,  385, 

386 
(See  also  Standard  deviation;  Variance) 
Angular  difference,  between  two  binomial  prob- 
abilities, 262,  264 
between  two  observed  proportions,  264 
Angular  transformation,  257-259,  261-264,   336. 
395-416 
(See  also  Inverse-sine  transformation) 
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Animal  experiments,  311,  341,  342,  343n.,  352 
Applied  Mathematics  Panel,  iii,  vii,  x 
Summary  Technical  Report  of,  vii 
Approximations,   involved  in   formulas   for   vari- 
ables  inspection   for   fraction   defective,    21, 
64-66 
effects  of,  21,  49,  50,  62-64 

table  showing,  65 
use  of  non-central  t  to  avoid,  64-66 
AQL  (see  Acceptable-quality  level) 
arcsin  x,  reference  to  table  of,  260n.,  265,  402n.,  416 
Arcsine   transformation,    257-259,   261-264,   336 
395-416 
(See  also  Inverse-sine  transformation) 
Arithmetic  mean,  of  sample  measurements,  161 
as  estimator  of  population  (lot)  mean,  12,  161, 

162,  192,  196-198,  200,  203,  212,  213 
formula  for,  12,  100,  101,  161,  192 

when      measurements      are      grouped       (or 
rounded),  196 
symbol  for  (x),  12,  161,  192 

when  measurements  are  grouped  or  rounded 

(xr),  196 
(See  also  Sample  mean) 
Arithmetic-probability  paper,  414 
Army,  viii 

Arnold,  Kenneth  J.,  ix,  x,  xii,  6,  354,  419 
ASN  (see  Average  sample  number) 
Assignable  causes,  269 

that  introduce  random   components   of  varia- 
tion, 297,  310,  311 
systematic  components  of  variation,  310,  311 
Asymptotic  expansion,  for  confidence  limits  for 
population  proportion,  336 
for  percentage  points  of  ^-distribution,  148 
for  percentage  points  of  ^-distribution,  155 
for  tolerance-limit  factor  K,  101 
Attributes,    inspection    by,    7-9,    14,    15,    26-28, 
229n.,  237-244,  259,  260,  263,  264,  331-338, 
358-362,  419-422,  427,  428 
advantages  and  disadvantages  of,  7-9 
amount  of  inspection,  14,   15,  26-28,  237-244, 
259,  260n.,  263,  264,  427,  428 
comparison  of  double,  multiple  and  sequen- 
tial sampling,  237-244 
figures  showing,  240-243 
comparison  of  sequential  and  single  sampling, 

26,  419,  420 
comparison  with  variables  inspection,  26-28 
formula    for    determining,     with    sequential 
sampling,  427,  428 
with  single  sample,  259,  260n.,  263,  264 
nomogram     for     determining,     with     single 
sample,  260 
procedure  for  using,  260n. 
when   severity   of   any   one   test    cannot   be 
controlled  precisely,  331-338 
(See  also  Amount  of  inspection) 
to  avoid  normality  assumption,  8,  229n. 
as  basis  for  deciding  whether  to  abandon  or  to 

complete  an  experiment,  355-362 
contrasted  with  inspection  by  variables,  7-9, 
26-28 


Attributes,  curtailment  of,  14,  15,  238 
with  double  sampling,  237-244 

amount  of  inspection,  average,  240,  242,  244 

variability  of,  238,  241,  243,  244 
characterized  by  relation  of  Rk  to  Ak,  237 
computational  procedures  for,   references  to, 

244 
contrasted     with     multiple     and     sequential 
sampling,  237-244 
formulas,  for  acceptance  line,  with  sequential 
sampling,  422 
for    amount    of   inspection,    with    sequential 
sampling,  427,  428 
with  single  sampling,  259,  260n.,  263,  264 
for   ASN    curve,    with   sequential   sampling, 

427,  428 
for     five-point     OC     curve,     for     sequential 

sampling,  427 
for    OC    curve,    for   single    sampling,    260n., 

261-263 
for  rejection  line,  with  sequential  sampling, 

422 
for     rejection     number     (proportion),     with 
single  sample,  260n.,  263,  264 
when  inspection  is  costly,  9 
when  inspection  is  destructive,  9 
with  multiple  sampling,  237-244 

amount  of  inspection,  average,  240,  242,  244 

variability  of,  238,  241,  243,  244 
characterized  by  relation  of  Rk  to  Ak,  237 
computational  procedures  for,  references  to, 

244 
contrasted      with     double     and     sequential 
sampling,  237-244 
nomogram,  for  determining  amount  of  inspec- 
tion and  OC  curve  for  single  sampling,  260 
procedure  for  using,  260n.,  263 
OC  curve,  for  sequential  sampling,  procedure 
for  five-point,  427 
single  sampling,  formula  for,  260n.,  263,  264 
nomogram  for,  260n.,  263 
when    range   in   samples    of    k   determines   lot 

quality,  229n. 
reasons  for  use,  7-9 

selection  of  AQL  in,  discussion  of,  45-47 
with   sequential   sampling,    26,    235-244,    357- 
362,  419,  420,  421,  422,  427,  428 
acceptance  line,  formula  for,  422 
when   acceptance   line   is   fixed   in   advance, 
358-362 
OC  curve,  procedure  for,  361,  362 
when  acceptance  line  is  omitted,  357,  358 

OC  curve,  formula  for,  358 
amount  of  inspection,  average,  26,  240,  242, 
244,  419,  420,  427,  428 
variability  of,  238,  241,  243,  244 
as  basis  for  deciding  whether  to  abandon  or 

to  complete  an  experiment,  357-362 
characterized  by  relation  of  Rk  to  Ak,  237 
computational  procedures  for  curtailed  (trun- 
cated)    sampling,     references    to,     244, 
361n.,  362n. 
steps  of,  361,  362 
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Attributes,   with  sequential  sampling,  contrasted 
with  double-  and  multiple-sampling  attri- 
butes inspection,  237-244 
contrasted  with  sequential  variables  inspec- 
tion, 26n.,  85 
single-sampling,  attributes  inspection,  26 
curtailed  (truncated),  361,  362 
OC  curve,   when  acceptance  line  is  fixed  in 
advance,  procedure  for,  361,  362 
when  acceptance  line  is  omitted,  formula 

for,  358 
general  five-point,  formulas  for,  427 
when    sampling    is    curtailed    (truncated), 
steps  in  determining,  361,  362 
rejection  line,  formula  for,  422 
when  severity  of  any  one  test  cannot  be  con- 
trolled precisely,  331-338 
with   single   sample,    14,    15,    26-28,    237,    238, 
259,  260^.,  263,  264 
acceptance  criterion  for,  260n.,  263,  264 
amount  of  inspection  for,  14,  15,  26-28 

contrasted    with    sequential-sampling    at- 
tributes inspection,  26-28 
contrasted    with    single-sample    variables 

inspection,  26-28 
formula  for,  259,  260n.,  263,  264 
nomogram  for,  260 

procedure  for  using,  260tc. 
characterized  by  relation  of  Rk  to  Ah,  237 
contrasted      with      double,      and      multiple 
sampling,  237 
with  sequential  attributes  inspection,  26 
with    single-sample    variables    inspection, 
26-28 
curtailed,  14,  15,  238 
OC  curve,  formula  for,  260n.,  263,  264 

nomograph  for,  260?i.,  263 
rejection  criterion,  260n.,  263,  264 
when  tests  are  destructive,  9 
Authorship,  ix 
Average    (see   Arithmetic    mean;    Mean;    Sample 

mean) 
Average  error,  162 

(See  also  Mean  deviation) 
Average  sample  number  (ASN,  N),  238,  420 

(<See  also  ASN  curve;  Sample  size,  average) 
ASN  curve,  233-234,  238,  240,  242,  244,  362,  427- 
428 
comparison    of,    for    double-,    multiple-,     and 
sequential-sampling  attributes  plans,  237- 
244 
figures  showing,  240,  242 
computation  of,  references  to  general  methods 

of,  244,  362n. 
definition  of,  for  general  one-parameter  tests,  427 

special  cases,  233,  238 
for   double-sampling   attributes   plans,    figures 

showing,  240,  242 
figures  showing,  234,  240,  242 
five      point,      for      sequential      one-parameter 

tests,  428 
formulas,  for  general  one-parameter  sequential 
test,  427 


ASN    curve,    formulas,   for  sequential   one-sided 
binomial  test,  427,  428 
with  acceptance  line  omitted,  358 
for  sequential  one-sided  test  of  a,  233 
maximum  of,  position  of,  428 
for  multiple-sampling  attributes  plans,  figures 

showing,  240,  242 
for  sequential  attributes  plans,  figures  showing, 
240,  242 
formula  for,  427,  428 
for  sequential  one-sided  binomial  test,  figures 
showing,  240,  242 
formula  for,  427,  428 
for  sequential  one-sided  test  of  <r,  figure  showing, 
234 
formula  for,  233 
Average  sample  size  (see  ASN  curve;  Sample  size, 
average) 


B:  critical  value  of  likelihood  ratio,  in  sequential 

analysis,  420 
B:  function  appearing  in  formulas  for  sequential 

variables  inspection,  84 
b:   critical   value   of   minus   X(  —  X)   in   sequential 

analysis,  84,  421 
b:   function   of   N,   Ka,   and   K\   that  appears  in 
formula  for  k  when  N  is  fixed,  19,  59,  61 
formulas  for,  20,  59,  61 
b:    multiple    of    o/\/2(N  —  1)    in    formula    for 
standard  deviation  of  s,  58 
formula  for,  58 
reference  to  table  of,  58n. 
b:  ratio  of  standard  deviation  of  s  to 

<t/\/2(N  —  1),  in  normal  samples,  58 
Baines,  A.  H.  J.,  281m.,  297,  313,  314,  318 
Barlow's  tables,  34,  93 
Bartlett,  M.  S.,  183,  385,  386,  387,  393,  398,  399, 

400,  401,  403-407,  411,  413-415 
Bartlett's  adjustment,  of  inverse-sine  transforma- 
tion, 399 
heuristic  derivation  of,  413,  414 
table  to  facilitate,  406 
Bartlett's  test  of  homogeneity  of  variance  esti- 
mates, 183,  387,  388 
compared  to  F-test  for  homogeneity  of  sample 

means,  386 
as  "omnibus  test,"  385,  386 
optimum  properties  of,  386 
separation   into   parts,   assignable   to   different 

factors,  386 
tables  to  facilitate  application  of,  reference  to, 
388n. 
Baten,  W.  D.,  193n.,  218n.,  222 
Bennett,  Rollin  F.,  ix 

|8:  probability  of  erroneously  accepting  hypothesis 
under  test,  271,  419 
taken  equal  to  0.50,  20,  21,  271,  274,  282,  283, 

313,  314 
use  in  determining  sequential  test  procedures, 
420,  421 
(See  also  Error  of  the  second  kind) 
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/3:   probability  that  lot   of   unacceptable   quality 
will  be  accepted,  16 

(See  also  Consumer's  risk) 
/3:  true  slope  of  a  linear  regression  line,  322 

estimation  of,  when  cost  of  each  measurement 
depends  upon  its  magnitude,  322—328 
j8i:  Pearson's  measure  of  skewness,  197,  198 
/3a:  Pearson's  measure  of  kurtosis,  197,  198 
Beta  function,  incomplete-,  392,  393 

percentage  points  of,  reference  to  table  of,  338, 
394 
use  to  determine  confidence  limits  for  popula- 
tion proportion,  333,  334 
Bias,  of  bombing  errors,  137,  147,  149 

of  mean  deviation  as  estimator  of  a,  169n. 

of  p  as  estimator  of  P,  397,  410 

of  <p(p)  and  <pb(p)  as  estimators  of  <p(P),  403,  404 

tables  showing,  400-402 
of  s  as  estimator  of  a,  58j,  162 

adjustment  factor,  formula  for,  58 
tables  of,  references  to,  58n. 
of  s2  as  estimator  of  cr2,  192,  199,  412 

when  data  are  grouped  or  rounded,  199 
of    sample    mean    as    estimator    of    population 
mean,  58,  162,  197,  198 
when  data  are  grouped  or  rounded,  197,  198, 
212,  213 
of    sample    median    as    estimator    of    normal 

population  mean,  162 
of  sample  proportion  as  estimator  of  population 
proportion,  397,  410 
when    inverse-sine    transformation    is    used, 
403,  404 
tables  showing,  400-402 
of  statistical  tests,  if  Sheppard's  corrections  are 

used,  199,  200,  203 
from  time  trends  in  data,  371,  372 
from  use   of  inverse-sine  transformation,   403, 
404 
tables  showing,  400-402 
from  use  of  Sheppard's  corrections,  199,  200,  203 
of  x  as  estimator  of  y.,  58,  162,  197,  198 

when  data  are  grouped  or  rounded,  197,  198, 
212,  213 
of  x  +  ks  as  estimator  of  /x  +  ka,  58 
Bigelow,  Julian  H.,  ix 
Binomial  distribution,  250,  251,  421,  427 

ASN    curve   for   sequential   one-sided   test   of 

parameter  of,  figures  showing,  240,  242 
formula  for,  427,  428 

confidence  limits  for  parameter  of,  332-336 
belts  giving  eighty  percent  limits,  335 

ninety  percent  limits,  334 
'     ninety-five  percent  limits,  333 
ninety-nine  percent  limits,  332 
accuracy  of,  335n. 
preparation  of,  335n.,  336n. 
sources  of,  335n. 
determination  of,  from  percentage  points  of 
incomplete  beta  function,  333,  334 
examples  of,  334,  337 
by  using  inverse-sine  transformation,  336 
by    using    non-central    ^-distribution,    76, 
77,  336n. 


Binomial     distribution,      confidence     limits     for 
parameter  of,  formula  for,  for  case  of  zero 
occurrences,  335    336 
relation  of,  to  percentage  points    of    incom- 
plete beta  function,  333,  334 
design   of   experiments,   for   comparing   param- 
eters of  two,  258-263,  424,  427,  428 
for   estimating   parameter   of   a   single,    238, 

333-337 
for  testing  parameter  of  a  single,  259,  260n., 
261-264 
double  dichotomies,  analysis  of,  by  means  of, 

249-253,  255 
examples  of,  251 

experiments  for  comparing  parameters  of  two, 

analysis  of  data  from,  249-258,  424,  425 

approximate  single-sample  method,  based 

on  binomial  test,  249,  250,  255 

based  on  Xc,  253-255,  256,  257 

based    on    inverse-sine    transformation, 

257,  258,  263,  264 
based  on  normal  deviates,  253-257,  263 
"exact"  single-sample  method  for,  250-253 
by  Fulcher-Zubin  Item  Analyzer,  257,  258 
incorrect  method  for,  256 
by  sequential  analysis,  424,  425 
by  Zubin-Fulcher  Item  Analyzer,  257,  258 
design  of,  258-265,  424 
inverse-sine    transformation    of    data    from, 

257,  258,  263,  264,  397-408 
number  of  observations  needed,  with  sequen- 
tial sampling,  424,  427,  428 
with  single  samples,  258-261,  264 
formula  for,  258,  259,  264 
nomogram  for,  260 
operating  characteristics  of  (OC),  261 
OC  curve  of,  determination  by  formula,  262 
from  nomogram,  263 
figure  showing,  262 
OC  surface  of,  261,  262 
planning  of,  258-265,  424 
sequential,  424,  425 
experiments    for    estimating    parameter    of    a 
single,  analysis  of  data  from,  332—336 
design  of,  238,  333-337 
inverse-sine  transformation  applied  to  data 

from,  336 
number  of  observations  needed,  333-337 
sequential,  238n. 
experiments  for  testing  parameters  of  a  single, 
analysis  of  data  from,  260n.,  263,  264,  421, 
422 
ASN     curves     for     double,     multiple,      and 

sequential  sampling,  240,  242 
design  of,  259,  260n.,  261-264,  427,  428 
inverse-sine    transformation    of    data    from, 

260n.,  261^264,  397-408 
number  of  observations  needed,  with  double 
sampling,  237-244 

figures  showing,  240-243 
with  multiple  sampling,  237-244 

figures  showing,  240-243 
with    sequential    sampling,    26,    237-244, 
419,  420,  427,  428 
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Binomial    distribution,    experiments    for    testing 
parameters    of    a   single,    number    of 
observations  needed,  with  sequential 
sampling,  figures  showing,  240-243 
formula  for,  427,  428 
with  single  sampling,   14,   15,  26-28,  259, 
260m.,  263,  264 
examples  of  (Table  1.3),  26 
formula  for,  259,  260n.,  263,  264 
nomogram  for,  use  of,  260«. 
OC    curve    of,    approximate    determination 
from  formula,  260n.,  262 
from  nomogram,  260n.,  263 
exact   determination    of   points    on,    using 
confidence-belt  charts,  336». 
using   percentage   points   of   incomplete 
beta  function,  334,  336n. 
planning  of,  259,  260rc.,  261-264,  427,  428 
sequential,  421,  422,  427,  428 
one-decision  test,  358 
formula  for,  examples,  251 

homogeneity    of    samples    from    two,     testing, 
249-258,  424,  425 
(See  also  Binomial  distribution,  experiments 
for  comparing  parameters  of  two) 
inverse-sine     transformation     of     data     from, 

257,  258,  259-264,  336,  397-408 
mean  of,  397,  410 

moments  of,  third  and  fourth  about  mean,  410 
nomogram,    for    determining    sample    sizes,    in 

testing  parameters  of,  260 
normal  approximation  to,  263,  264 
number  of  observations  needed,  for  comparing 
parameters   of  two,   258-261,   264,   424, 
427,  428 
formula  for,  258,  259,  264 
nomogram  for,  260 
for    estimating    the    parameter    of    a    single, 

333-337 
for  testing  the  parameter  of  a  single,   with 
double  sampling,  237-244 
figures  showing,  240-243 
with  multiple  sampling,  237-244 

figures  showing,  240—243 
with    sequential    sampling,    26,    237-244, 
419,  420,  427,  428 
figures  showing,  240-243 
formula  for,  427,  428 
with  single  sampling,   14,   15,  26-28,  259, 
260n.,  263,  264 
examples  of,  26 

formula  for,  259,  260n.,  263,  264 
nomogram  for,  use  of,  260n. 
OC  curves,  of  tests  of  parameter  of,  262 
one-decision  test  of  parameter  of,  358 
planning  experiments  for  comparing  parameters 
of  two,  258-263,  424,  427,  428 
for  estimating  parameter  of  a    single,    238, 

333-337 
for  testing  parameter  of  a  single,  259,  260n., 
261-264 
Poisson  exponential  approximation  to,  426,  428 
sequential  estimation  of  parameter  of,  reference 
to  work  on,  238n. 


Binomial   distribution,  sequential  test,   for  com- 
paring two,  424,  425 
for  testing  parameter  of  a  single,  421,  422, 
427,  428 
one-decision  test,  358 
single  sampling  for  comparison  of  two,  249-264 
for  estimation  of  parameter  of,  332—336 
for  testing  parameter  of,  14,  15,  26-28,  259, 
260n.,  263,  264 
standard  deviation  of,  255,  256,  397 
variance  of,  397,  410 
Binomial-probability  paper,  414n. 
Biological  assay,  341,  342,  352 
(See  also  Sensitivity  tests) 
Biological  experimentation,  311 
Bishop,  D.  J.,  386n.,  393 
Bivariate  distribution,  normal,  146 

relation    of    Hotelling's    generalized    T   to    ex- 
ponent of,  114,  115 
Bliss,   Chester  I.,  342n.,   343n.,   350n.,  352,   398, 

400,  405,  415 
Bombsights,  air  testing  of,  function  of,  117 

multivariate   quality   control   applied   to,   111- 

184 
problems  of,  116 
Bowker,  Albert  H.,  ix,  xi,  96,  101n.,  109 
Breaking  strength,  365 

(See  also  Metallurgical  experiments) 
Breen,  Nancy  Brixey,  x 
Broad  classes  (see  Grouping) 
Brown,  Richard  H.,  343n.,  352 
Brown  University,  ix 

Bureau  of  the  Census,  ix,  xi,  xii,  226,  354 
Bureau  of  Ordnance,  Navy,  352 
Bureau  of   Standards   (see  Mathematical  Tables 
Project;  National  Bureau  of  Standards) 


C2'.  ratio  of  expected  value  of  root  mean  square  of 
deviations  from  sample  mean,  to  a,  in  normal 
samples,  58 
reference  to  tables  of,  58n. 
relation  to  a,  58n. 
Calculating  machines,  comparison  of  makes   and 
models,  34n. 
use  of,  to  simplify  computation  of  s,  33,  34 
Camp-Meidell  inequality,  49n.,  221 
Carver,  H.  C,  194n.,  222 
Causes,  assignable,  269 

that  introduce  components  of  random  varia- 
tion, 297,  310,  311 
that    introduce    components    of    systematic 
variation,  310,  311 
chance,  297,  310,  311 
Cayley,  Arthur,  191,  222 
Central  tendency,  measures  of,  9 
Chance  causes,  297,  310,  311 

Characteristics,     operating,    of    sampling-inspec- 
tion plans  and  statistical  tests   (see  Operat- 
ing  characteristics;    Operating-characteristic 
curves;  Operating-characteristic  surface) 
Charts,  control  (see  Control  charts) 
list  of,  xiv 
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Check  list,  of  possible  causes  of  lack  of  control,  in 
air  testing  of  bombsights,  140 
of  steps  for  calculating  5(/,  to,  e),  74,  75 
for  calculating  t(f,  5,  e) ,  75,  76 
for  computing  OC  curve,  for  any  curtailed 
(truncated)   sequential  inspection  (test) 
procedure,  361 
for    single-sampling    variables    inspection 
plan  for  fraction  defective,  32,  33 
for  determining  k,  given  N,  pi,  and  1  —  a,  76 

given  N,  pi,  and  /3,  76 
for   determining   N  and   k,   given   pi,   pi,   a, 

and  /3,  81,  82 
for  determining  p,  given  N,  wo,  and  e,  76,  77 
for  determing  uo,  given  N,  pi,  and  e,  76 

(See  also  Computation) 
of    general    likelihood-ratio    sequential    test, 

420,  421 
for    selection    of    bombsights    for    test,    and 

arrangement  of  bombings,  119-122 
of  sequential  procedures  for  locating  a  maxi- 
mum, 367,  368 
Xc:  chi(x)  corrected  for  continuity,  254 

calculation  of,  from  observed  proportions  (per- 
centages), 256,  257 
from  2X2  table,  254,  255 
significance  levels  of,  255 
X2  (chi-squared) :  a  variable  having  the  x2-distri- 
bution,  73,  91,  270 
addition  theorem  for,  153 
adjusted  value  of,  254 

calculation  of,  from  proportions  (percentages), 
256,  257 
from  2X2  table,  254,  255 
correction  for  continuity,  254 
significance  levels  of,  255 
small  expected  numbers,  253 
correction  for  continuity,  general  procedure  for, 

254n. 
distribution  of,  formula  for,  270 

percentage  points  of,  108-110,  315n. 

Cornish-Fisher   type   expansion  for,  refer- 
ence to,  109 
relation  of  Ka  to,  258n. 
relation  to  partial  sum  of  Poisson  series,  109 
tables  of,  315n. 
relation  of  F-distribution  to,  316 
of  normal  distribution  to,  258rc. 
of  s2-distribution  to,  270,  315 
of  T-distribution  to,  155 
formulas  for,  special,  254,  256,  257 
significance  levels  of,  108-110,  315n. 

Cornish-Fisher  type  expansion  for,  reference 
*'    to,  109 

relation  to  partial  sum  of  Poisson  series,  109 
tables  of,  315n. 
test  of  homogeneity,   of  proportions   (percent- 
ages), 256,  257 
of  2  X  2  table,  254,  255 
of  variance  estimates,  387,  388 
(See  also  Bartlett's  test) 
use  in  planning  and  interpreting  experiments 
for   testing   standard    deviation    (or    vari- 
ance) estimates,  269-280 


Class  interval  (see  Grouping  interval) 

Clopper,  C.  J.,  335n.,  338 

Cochran,   William  C,  254n.,  265,  387-389,  392, 

393,  399,  407,  408,  415 
Cochran's  test,  for  significance  of  largest  variance 
estimate,  383-394 
contrasted  with  Bartlett's  test  for  homogeneity 

of  variance  estimates,  386,  387 
example,  worked,  388,  389 
significance  levels  of,  tables  of,  390,  391 
accuracy  of,  388,  393 

computation  of,  method  employed,  389-393 
interpolation  in,  method  of,  388 
Coefficient,  correlation,  413 

regression,    estimation   of,    when   cost   of   each 
measurement  depends  upon  its  magnitude, 
322-328 
of  line  fitted  to  probit  data,  343 
estimation  of,  343,  344 
relation  to  n  and  a,  343 
of  s,  for  setting  tolerance  limits  (see  K) 

in  variables  inspection  for  fraction  defective 
(see  k) 
Columbia  University,  iii,  ix-xi 

Committee  on  War  Research  of,  x 
Division  of  War  Research  of,  vii 
Statistical  Research  Group  of,  iii,  vii-x,  6,  8,  20, 
26,  34,  42,  52,  83,  85,  93,  101,  108,  109,  143, 
184,  188,  231,  234,  237,  244,  257,  260,  335, 
358,  361,  362,  400,  405,  419,  420,  428 
Trustees  of,  vii 
Combination  of  observations,  42,  43,  113,  114,  167 
Combinations    of    grouping-interval    widths    and 
sample  sizes  satisfactory  for  statistical  tests, 
213,  214 
Combining  estimates  of  n,  42,  43 
of  a,  42,  43 
probabilities  from  statistical  tests,  167,  184 
tests  for,  50-52,  113-116 

effect  of,  on  probability  of  acceptance,  50,  51 
into  a  single  "score,"  51,  113,  114 

based  on  generalized  Student's  T,  114-116 
values  of  s  or  s2,  42,  43 
values  of  x,  42,  43 
Comparison,      of     two     binomial     distributions, 
design  of  and  analysis  of  data  from  experi- 
ments for,  247-265,  424,  425,  427,  428 
of  two  means,   design   of  and  analysis  of  data 

from  experiments  for,  61,  62 
of  two  percentages  (or  proportions),  design  of 
and  analysis  of  data  from  experiments  for, 
247-265,  424,  425,  427,  428 
of    two    standard    deviations     (or    variances), 
design  of  and  analysis  of  data  from  experi- 
ments for,  267-318 
Components  of  variance,  269,  297,  310-314 
Compounding  probabilities  from  statistical  tests, 

167 
Computation,  of  acceptance  line,   232,  233,  356, 
357,  423,  425 
acceptance  number,  260n.,  263,  264,  422,  425 
acceptance  region,  boundaries  of,  36-41,  52-56. 

57,  67-71,  232,  233,  422,  423,  425 
adjusted  value  of  chi  or  chi-squared,  254,  255 


INDEX 


441 


Computation,  adjustment  constant,  optimum,  41 , 
54,  70,  71 
analysis-of- variance  table,  312 
angular     difference     between      two      binomial 
probabilities,  264 
between  two  observed  proportions    (or   per- 
centages), 262 
arithmetic  mean,  12,  100,  192 
ASN  curves  for,  233,  244,  362n.,  427,  428 
Bartlett's    adjustment,    of    inverse-sine    trans- 
formation, 399 
Bartlett's  test,  of  homogeneity  of  variance  esti- 
mates, 387,  388 
bias  of  s  as  estimator  of  <r,  58 

of    s2    as    estimator    of   a2,    when     data     are 

grouped,  199 
of  x  +  ks  as  estimator  of  n  +  ka,  58 
binomial  distribution,  251 

Xc,  from  observed  proportions  or  percentages, 
256,  257 
from  2X2  table,  254,  255 
components  of  variance,  310-314 
confidence  limits,  for  p,  when  a  is  known,  197,  198 
when  a  is  unknown,  99,  200-203,  213-215 
for     parameter     of     binomial     distribution, 

333-336 
for  population  mean,  99,  197,  198,  200-203, 

213-215 
for  population  proportion,  42,  43,  333-336 
for  <r  or  a2,  198-200,  200-203,  213-215 
control  limits,  for  means,  147-149 
for  T-statistics,  152,  153 
for  variances,  151 
«(/,  to,  e),  74,  75 

dosage-mortality  curve,  342-344,  350,  352 
efficiency,  of  estimators  of  population  param- 
eters, 160-163,  227n. 
of    inspection    plans    and    statistical    tests, 

26-28,  85 
(See  also  ASN  curve) 
expected  frequencies,  in  2  X  2  table,  253,  254 
hi  and  hi,  232,  233,  356,  357,  422,  423,  425 
K,  approximate,  101,  108 

exact,  108,  109 
k,  approximate,  17-19 
exact,  76-79,  81,  82 
LP,  28-33,  50-52,  76,  77,  80,  358,  361,  427 
Ltf,,  282-296,  316,  317 
Lp,  231,  271-279,  315 
La,  232,  233 
Lt,  262,  263 
N,  approximate,  17 

exact,  81,  82 
Na,  259-260,  260n.,  263,  264 
Nm,  28n.,  62 

Nb,  27n.,  28n.,  231,  274,  275,  278 
Nt,  17-21,  26-28,  60-66,  81,  82 
number  of  observations  needed,  for  analysis-of- 
variance  tests,  311-314 
for  attributes  plans,   with  sequential  samp- 
ling, 427,  428 
with  single  sampling,  259,  260n.,  263,  264 
when  severity  of  any  one  test  cannot  be 
controlled  precisely,  331-338 


Computation,    number   of   observations   needed, 
for  binomial  test,  one-sided,  with  sequen- 
tial sampling,  427,  428 
with  single  sampling,  259,  260n.,  263,  264 
for  x2  tests  of  variance  estimates,  231,  274, 

275,  278 
for  curtailed  (truncated)  sequential  sampling, 

362n. 
when  decisions  are  based  on  signs  of  differ- 
ences between  sample  means,  380 
for  determining  whether  new  process  (prod- 
uct) is  more  (less)  variable  than  stand- 
ard, 231,  274,  275,  278 
whether    one    process    (product)    is    more 
variable  than  another,  280,  282,  283,  296 
for  difference  of  means  test,  when  a  is  known, 
with  single  sample,  61,  62 
when  <x  is  unknown,  17,  27n.,  28n.,  60-66, 
81,  82 
for     difference     of     proportions     test,     with 
sequential  sampling,  424,  427,  428 
single  sample,  259,  260,  263,  264 
for    double-dichotomy    test,    259,    260,    263, 

264,  424,  427,  428 
for  F-tests  of  variance  estimates,   280,   282, 

283,  311-314 
for  fraction-defective  test  based  on  variables, 
single  sample,  17 
when  fx  is  known,  and  a  unknown,   27n., 

28rc.,  231,  274,  275,  278rc. 
when  [i  is  unknown,   and  a  known,   27«., 

28n.,  62 
when  /x  and  <r  are  both  unknown,   17-21, 
27n.,  28n.,  60-66,  81,  82 
for  general  one-parameter  tests,  sequential, 
427 
single  sample,  61,  62 
for  mean  test,  single  sample,  a  known,  27n., 
28n.,  62 
a  unknown,  17-21,  27n.,  28n.,  60-66,  81,  82 
for    one-decision    sequential    binomial    test, 

358,  427,  428 
for  proportion  test,  sequential  sampling,  427, 
428 
single  sample,  259,  260n.,  263,  264 
when  range  in  samples  of  k  determines  lot 
quality,  sequential  sampling,  233 
single  sample,  231,  274,  275,  278m. 
for    setting    tolerance    limits,    references    to, 

109,  110 
for  standard  deviation  tests,  231,  274,   275, 

278,  280,  282,  283,  296 
for    variables    inspection    plans    for   fraction 
defective,  17-21,  27n.,  28n.,  64-66,  81,  82 
OC   curve,  for  analysis-of-variance  tests,  310- 
314 
for    attributes    plans,    sequential    sampling, 
426,  427 
single  sampling,  260n.,  262,  263 
for  binomial  test,  one-decision  sequential,  358 
one-sided  sequential,  426,  427 

truncated,  361 
one-sided  single  sample,  260n.,  262,  263 
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Computation,  OC  curve,  for  x2-tests  of  variance 
estimates,  231,  232,  271-279,  315 
for  comparison  of  two  means,  a  known,  61,  62 

a  unknown,  30,  59-63,  71-73,  76,  77,  80 
for   comparison   of  two   proportions   or   per- 
centages, sequential,  424,  426   427 
single  sample,  262,  263 
for   comparison   of  two  standard   deviations 
or    variances,    single    sampling,    282-296, 
316,  317 
when  decisions  are  based  on  signs  of  differ- 
ences between  sample  means,  378,  379 
for  double-dichotomy  test,  262,  263 
for  F-tests   of   variance   estimates,   282-296, 

316,  317 
for  fraction-defective  test,  based  on  variables, 

28-33,  76,  77,  80 
for  general  one-parameter  tests,    sequential, 
426,  427 
single  sample,  61,  62 
for  mean  test,  a  known,  61,  62 
for  one-decision  sequential  binomial  test,  358 
for    proportion    test,    sequential    sampling, 
426,  427 
single  sample,  260n.,  2G2-263 
when  range  in  samples  of  h  determines  lot 
quality,  sequential,  233 
single  sample,  231,  271-279,  315 
for  standard  deviation  tests,  231-233,  271- 

279,  282-296,  315-C17 
for   variables   inspection    plans   for   fraction 
defective,  28-33,  76,  77,  80 
<p  and  <pb,  399,  401 
</>(a,(3,ni,n2),  282-29G,  310,  317 
RP,  for  one-decision  sequential  binomial  test, 

358 
p(a,/8,?i),  231,  271-279,  315 
sors^  12,33,34,  162,  192 

significance  levels,  of  Cochran's  test,  389-393 
of  T-distribution,  155 
of  T-statistics,  approximate,  155 
of  i-distribution,  approximate,  148 
T  (angular  difference),  262 
T  (Hotelling's),  142,  154 
^-statistics,  131,  133,  135,  145 
t(f,8,e),  75,  76 
tolerance  limits,  97-109 
two-point    estimate    of    n    and    a    from    probit 

data,  350 
m,  76 

x,  12,  100,  192 

yardsticks,  for  differences  of  means,  150 
for  ratio  of  two  variance  estimates,  152 
for  T-statistics,  156 
Computer's  rule  (see  Rounding  rule) 
Comrie,  L.  J.,  93 
Conditional  probability,  41,  251 
Confidence  in  conclusions,  98,  269 
Confidence  belts,  accuracy  of,  335n. 
for  population  proportion,  332-336 
eighty  percent,  335 
ninety  percent,  334 
ninety-five  percent,  333 
ninety-nine  percent,  332 


Confidence  belts,  preparation  of,  335n.,  336n. 

sources  of,  335n. 
Confidence  coefficient  (y,T),  43,  98,  100,  108,  332, 
336 
definition  of,  98 
interpretation  of,  43 
Confidence  intervals,  42,  43,  99,  100,  332-336 
contrasted    with    statistical    tolerance    ranges, 

99,  100 
definition  of,  42 

dependence  of  width  of,  on  confidence  coeffi- 
cient, 43 
on  sample  size,  99,  100 
for  parameter  of  binomial  distribution,  332-336 
for  population  proportion,  based  on  attributes 
inspection,  332-336 
based  on  variables  inspection,  42—45 
for  process-average  percent  defective,  based  on 
attributes  inspection,  332-336 
based  on  variables  inspection,  42,  43 

(See    also    Confidence    belts;     Confidence 
limits) 
Confidence  limits,  99,  100,  196-203,  332-336 
contrasted  with  statistical  tolerance  limits,  99, 

100 
effect  of  grouping  (rounding)  on,  192,  196-203, 

213-215 
for  ix,  when  a  is  known,  197,  198 

based  on  grouped  (rounded)  data,  197,  198 
when  a  is  unknown,  99 

based  on  grouped  (rounded)  data,  200-203, 
213-215 
for  parameter  of  binomial  distribution,  332-336 
belts  giving,  332-335  (see  Confidence  belts) 
determination  of,  accurate,  from  percentage 
points    of   incomplete   beta   function. 
333,  334 
examples  of,  334,  337 
approximate,    by    means    of    inverse-sine 
transformation,  336 
by  means  of  non-central  ^-distribution, 
76,  77,  336n. 
formula  for,  accurate,  for  case  of  zero  occur- 
rences, 335 
approximate,  336 
relation  of,   to  percentage  points  of  incom- 
plete beta  function,  333,  334 
for  population  mean,  99,  197,  198 
for  population  proportion,  332-336 
for  population  standard  deviation  (or  variance), 
based  on  rounded  data,  198-200,  213-215 
(See  also  Confidence  belts;  Confidence  inter- 
vals) 
Consumer's  risk  (/3),  conventional  value  of  (10%), 
46 
definition  of,  16,  228,  229 

effect  of  reduced  or  tightened  inspection  on,  45 
maximum  of,   with   variables  inspection  plans 

for  fraction  defective,  16,  17 
selection  of,  factors  to  consider  in,  45-49 
Consumption  requirements,  analysis  of,  47-49 
Contingency    tables,    2X2,    249-265,    424,    425 
(see   Binomial   distribution,   experiments   for 
comparing  parameters  of  two) 
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Contracts,  vii 

Control  charts,   applied  to  air  testing  of  bomb- 
sights,  122-142 
some  novel  features  of,  119-147 
trouble  shooting  with  aid  of,  137-140 
Control  limits,  for  means,  147-150 

allowance  for  sampling  variation  of  data  in 
base  period,  119,  147,  148 
for  T"-statistics,  152-155 
for  variances,  151,  152 
(See  also  Quality  control) 
Cornish-Fisher  expansion,  109 
Correction,  Bartlett's,  399 

heuristic  derivation  of,  413,  414 
table  to  facilitate,  406 
of  chi  or  chi-squared,  for  continuity,  254,  255, 
257 
general  procedure  for,  254n. 
factors:  a,  b,  and  a,  58 
for  grouping,  Sheppard's,  193-196,  222-223 

not  to  be  used  in  connection  with  confi- 
dence limits  or  tests  of  significance,  199, 
200,  203 
of   inverse-sine   transformation,    399,   405-407, 

413,  414 
Yates',  254,  255 

(See  also  Adjustment) 
Correlation  coefficient,  mean  of,  approximate,  413 
transformation  of,  to  stabilize  variance,  413 
variance  of,  approximate,  413 
2-transformation  of,  413 
Cost,  of  experiment,  269 

of  measurements  dependent  upon  their  magni- 
tudes, 321,  322 
linear  regression  analysis  when,  322-328 
Craig,  Cecil  C,  93,  194n.,  195n.,  196n.,  222 
Critical  level  of  a  normally  distributed  test  func- 
tion, derivation  of,  61,  62 
Crum,  W.  L.,  145n.,  183 

Cumulative  distribution,  estimating  fj.  and  a  from 
data  on,  339-352 
optimum  design  of  experiments  for,  345,  346, 
349-352 
Curtailment,  of  sequential  test,  361 

of   single-sample   inspection   plan   for   fraction 
defective,  based  on  attributes,  14,  238 
based  on  variables,  14 
Curtiss,  John  H.,  407,  410,  415 
Curve  fitting,   when  cost  of  each    measurement 
depends  upon  its  magnitude,  321-328 
when    data    are    observations    on    cumulative 

distribution,  342-352 
when  data  are  probits,  342-352 
to  data  from  sensitivity  tests,  342-352 


d:  distance  from  target,  342 
Daniels,  H.  E.,  297n.,  318 

Data,  adjusted,  in  variables  inspection  for  fraction 
defective,  36-41,  54,  69-71 
amount  of  (see  Amount  of  inspection;   N;   N; 
Sample  size) 


Data,  on  attributes,  7 
bias  of,  160,  371 

combination  of,  42,  43,  113-116,  167 
correlated,  50-52,  113-116 
on  cumulative  distribution,  339-352 
grouped, 187-215 
probit,  339-352 
representative,  160 
rounded,  187-223 
selection  of,  reasons  for  random  step  in,  158- 

160 
tabular,  215-221 
time  trends  in,  371 

transformation  of,  to  stabilize  variance,  408- 
413 
on  variables,  7 
Davis,  O.  L.,  389?;.,  393 
Decimals,  significant,  189n.,  190n. 
Decision  function,  381,  382 

Decisions,    based    on    inspection    by    attributes, 
7,  237 
by  variables,  7 
based    on   sign    of   difference   between    sample 
means,  377 
conditions  rendering  optimal,  381,  382 
probability  of  being  wrong,  378-380 
figure  for,  379 
table  for,  378 
regarding  largest  of  two  or  more  variance  esti- 
mates, 386,  387,  388,  389 
regarding  level    of   performance   of   process   or 

product,  61,  62,  377,  423 
regarding   variability   of   processes    (products), 

231,  233,  269-271,  278,  279,  280-283 
wrong,  271,  281,  282,  313 
Deckinger,  R.  L.,  335n. 
Defective,  7 

Degrees  of  freedom  (f,n,v),  in  analysis  of  variance, 
312 
of  x2,  153,  231,  232,  270 
division  by,  12,  13,  43,  312 
of  F,  151,  281,  296,  312 
of  numerator  and  denominator  of  F  to  be  chosen 

equal,  296 
partitioning  of,  312,  386 

of  s  or  S2,  12,  13,  43,  199,  203,  270,  312,  387,  388 
of  T,  155 

of  T-statistics,  145,  152-154 
of  t,  148,  203 
non-central,  73 
5:  a  constant  in  numerator  of  non-central  t,  73 
Deming,  W.  Edwards,  110 
Department,  of  Agriculture,  352 

of  Commerce  (see  Bureau  of  the  Census;  Na- 
tional Bureau  of  Standards) 
Navy  (see  Bureau  of  Ordnance,  Navy) 
War  (see  Army) 
Departures  from  normality,  seriousness  of,  49,  50, 

99,  346,  377,  381 
Derivation,  of  acceptance  criteria,  59-63,  66-71, 
263,  264,  270,  280 
of  angular  transformation,  410,  411 
of  Bartlett's  adjustment,  413,  414 
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Derivation,   of  criteria  for  "immediate"   accept- 
ance,   in    variables    inspection    for    fraction 
defective,  66-71 
of  "exact"  test  for  independence  in  2  X  2  table, 

250-253 
of  formulas  for,  A  (adjustment  constant),  70,  71 
A  (critical  level),  61,  62 
adjusted  value  of  chi  or  chi-squared,  254n. 
average  number  of  observations,  references 

to,  244 
Bartlett's  adjustment,  of  inverse-sine  trans- 
formation, 413,  414 
Xe,  254n. 

critical  level,  61,  62 
K,  references  to,  101,  108 
k,  59-63 

LP,  30,  59,  61,  62 
Lt,  261,  262 
number  of  observations  needed,  60-63,  262- 

264,  380 
OC  curves,  30,  59,  61,  62,  261,  262 
<PB,  413,  414 

<t>  (a,/3,ni,n2),  281n.,  316,  317 
probability  that  sample  means  are  in  opposite 

order  to  population  means,  378,  379 
p  (a,/S,n),  315 

tolerance-limit  factor,  references  to,  101,  108 
two-point  estimates  of  /x  and  a  from  probit 
data,  350 
variances  of,  351,  352 
variability  of  amount  of  inspection,  references 
to,  244 
of  general  formulas  for  critical  level  and  sample 

size,  61,  62 
of  general  tranformation  to  stabilize  variance, 

408-410 
of  inverse-sine  transformation,  410,  411 
of  logarithmic  transformation  of  s  or  s2,  412,  413 
of  optimum  arrangement  for  estimating  n  and  a 

from  probit  data,  349-352 
of  optimum  choice  of  adjustment  constant,  70, 

71 
of  rejection  criteria,  59-63,  66-71,  263,  264,  270, 

280 
of  square-root  transformation,  of  a  Poisson  vari- 
able, 411 
of  standard  procedure  for  testing  whether  new 
process    (product)    is   more   variable   than 
standard,  270,  271 
whether  one  process  (product)  is  more  vari- 
able than  another,  280,  281 
of  tolerance-limit  factors,  references  to,  101,  108 
of  transformation  to  stabilize  variance,  408-410 
of  variances  of  two-point  estimates  of  \x  and  <r 

from  probit  data,  350-352 
of   2-transformation    of    correlation    coefficient, 
413 
Design  of  experiments,  245-372 

abandoning  prior  to  completion,  with  provision 

for,  353-362 
for  air-testing  bombsights,  116-122,  140-147 
analysis  of  variance,  leading  to,  297,  310-314 
for  comparing  two  means,  61,  62,  380,  381 


Design  of  experiments,  two  proportions  or  per- 
centages, 258-259,  424,  425 
two  standard  deviations  or  variances,  267- 
318 
when   cost   of   any   one   measurement   depends 

upon  its  magnitude,  319-328 
when  data  are  observations  on  cumulative  dis- 
tribution, 339-352 
when  data  are  probits,  339-352 
when  decisions  are  based  on  signs  of  differences 

between  sample  means,  377-382 
for  determining  existence  of  components  of  ran- 
dom variation,  297,  310-314 
for  determining  maximum  of  a  function,  363- 

372 
for  determining  whether  level  of  performance 
of  new  process  (product)  exceeds  standard, 
17,  27n.,  28n.,  60-66,  81,  82,  423 
whether  level  of  performance  of  one  process 

(product)  exceeds  that  of  another,  61,  62 
whether  variability  of  new  process  (product) 
exceeds  standard,  269-278 
is  less  than  standard,  278-280 
where   variability   of   one   process    (product) 
exceeds  variability  of  another,  280-296 
for  dosage-mortality  studies,  339-352 
for   estimating   components    of   variance,    297, 

310-314 
factorial,  356,  366,  367,  369-372 
failure  of  equipment,  with  provision  for  rate  of, 

355 
when  independent  variable  is  known  only  in 
terms  of  a  probability  distribution,  331-338 
for  maximum  of  a  function,  363-372 
for  probit  data  studies,  339-352 
for  proofing  gun  barrels,  331-338 
randomization  in,  141,  371 
rate  of  failure  of  equipment,  provision  for,  in, 

355 
replication  in,  141,  159 
for  sensitivity  tests,  339-352 
sequential,  417-429 

for  determining  maximum  of  a  function,  367- 

372 
when  rate  of  failure  of  equipment  is  high, 

353-362 
when  severity  of  any  one  test  cannot  be  con- 
trolled precisely,  331-338 
Designs  (see  Experiment  designs) 
Destructive  tests,  9,  321,  331,  341 
Deviate  (see  Normal  deviate) 

Deviation  (see  Mean  deviation;  Standard  devia- 
tion) 
Differences  between  means,  acceptance  criterion 
for  non-significance  of,  a  known,  61,  62 
a  unknown,  71,  72,  150 
analysis-of-variance  test  of  significance  of,  385, 

386 
design  of  experiments  when  decisions  are  to  be 

based  on,  61,  62,  377-382 
effect   of   grouping   or   rounding   on   test   of,    a 
known,  196-198 
a  unknown,  200-203,  214,  215 
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Differences  between  means,  F-test  of,  385,  386 
number  of  observations  needed  for  test  of,  a 
known,  61,  62 
<r  unknown,  71-73,  81,  82,  150 
one-sided  test,  <r  known,  sequential,  423 
single  sample,  61,  62 
a  unknown,  sequential,  71,  83-85,  426 
single  sample,  71,  72 
sequential  test  of,  a  known,  423 
a  unknown,  71,  72,  83-85,  426 
signs  of,  basis  for  decisions,  377-382 
i-test  of,  sequential,  71,  72,  83-85,  426 

single  sample,  71,  72,  81,  82,  150 
two-sided  test,  <r  known,  61,  62,  426 

a  unknown,  150,  426 
yardsticks  for,  150 
Differences  between  proportions  or  percentages, 
acceptance    criteria   for   non-significance    of, 
251-255,  263,  264,  424,  425 
angular  representation  of,  262,  264 
design  of  experiments  for  testing,  258-265,  424, 

425 
number  of  observations  needed  for  test  of,  262- 

264 
sequential  test  for,  424,  425 

single-sample   test,   approximate   methods   for, 
249,  250,  253-257,  263 
"exact"  method,  250-253 
Fulcher-Zubin  Item  Analyzer,  257,  258 
incorrect  method,  256 

(See  also  Binomial  distributions,  experiments 
for  comparing  parameters  of  two) 
Digits,  significant,  189n.,  190n. 
Discontinuities  of  sampling  distributions,  result- 
ing from  grouping  data,  203-218 
Discriminant  function,  best,  for  discriminating  be- 
tween two  populations  of  known  form,  381, 
382 
Discriminating  power  of  experiment,  269 
Dispersion,    measures    of    (see    Mean    deviation; 

Range;  Standard  deviation) 
Distribution,  of  adjusted  value  of  chi  (xc),  255 
of  amount  of  inspection,  with  double,  multiple, 
and  sequential  sampling,  235-244 
computational  methods  for  determining,  244 
figures  showing,  241,  243 

nature  of,  in  repeated  sequential  sampling, 
428 
of  angular  difference  between  two  proportions 

or  percentages,  263,  264 
of  arithmetic  mean,  197,  198 

when  data  are  grouped,   196-198,  203,   212, 
213 
of  average  error,  162,  163 
of  Bartlett's  test  for  homogeneity  of  variance 

estimates,  387,  388 
beta  function,  incomplete,  333,  334,  338,  392- 

394 
binomial,  250,  251,  421,  427 
bivariate  normal,  146 
of  chi,  corrected  for  continuity  (x<0,  255 
of  chi-squared  (x2),  formula  for,  270 

percentage  points  of,  108-110,  258n.,  315n. 
relation  to  F-distribution,  316 


Distribution,    of    chi-squared    (x2),    relation    to 
normal  distribution.  258n. 
to  s2-distribution,  270,  315 
to  ^-distribution,  155 
of  Cochran's  test  for  significance  of  largest  vari- 
ance estimate,  389,  392,  393 
of  correlation  coefficient,  413 
cumulative,  339-352 
Error  (see  Normal  distribution) 
of  error  of  an  interpolated  value,  221 
of  F,  formula  for,  281 

relation  to  chi-squared  distribution,  316 

to  incomplete  beta  function,  392,  393 
significance  levels  of,  394 

detailed  references  to,  3l7n. 
Paulson's  approximation,  393,  394 
when  data  are  rounded,  214,  215 
Gaussian  (see  Normal  distribution) 
generalized  normal,  197 

(See  also  Gram-Charlier  series) 
of  generalized  Student  ratio  (T),  155 
of  Hotelling's  T,  155 
J-shaped,  194,  222 
kurtosis  of  a,  measures  of,  197,  198 
of  mean  (see  Distribution  of  arithmetic  mean) 
of  mean  deviation,  162,  163 
of  median,  162 

moments  of,  about  origin,  193 
about  true  mean,  195 
centroidal,  195 

effect  of  grouping  or  rounding  on,  193-196 
ideal,  193 
realistic,  193 
of  N,  in  double,  multiple,  and  sequential  sam- 
pling, 235-244 
computational  methods  for  determining,  244 
figures  showing,  241,  243 

nature  of,  in  repeated  sequential  sampling, 
428 
non-central  t,  8n.,  22,  64,  71-82 

use  in  variables  inspection  for  fraction  defec- 
tive, 71-82 
non-normal,  99,  381 
normal,  figure  showing,  10 
formula  for,  161 
(See  also  Normal  distribution) 
Poisson,  411,  412,  426 

relation  to  x2  distribution,  109 
of  proportions   or  percentages,   250,   251,   263, 
264,  397 
(See  also  Binomial  distribution) 
of  r,  413 

of  range,  229,  230,  234 
rectangular,  195,  214,  215 
of  s  or  s2,  58,  199 

when  data  are  grouped  or  rounded,  198-200, 

203,  204 
relation  to  x2  distribution,  270,  315 
semi-invariants  of,  196n. 
skewness  of,  measures  of,  197,  198 
of  Student's  t,  201,  202 

when   data    are   grouped    or    rounded,    200, 
203,  214,  215 
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Distribution,  of  Student's  z,  187,  188,  200 
of  T,  154.  155 

relation  to  x2  distribution,  155 
relation  to  F-distribution,  155 
of  !F-statistics(  152-155 
of  *,  72,  148 

effect  of  grouping  or  rounding  on,  200-203, 

214,  215 
non-central,  8n.,  22,  64,  71-82 
percentage  points  of,  148,  148n. 
of  variance  ratio  (F),  formula  for,  281 
relation  to  x2-distribution,  316 

to  incomplete  beta  function,  392,  393 
significance  levels  of,  394 
detailed  references  to,  3l7n. 
Paulson's  approximation,  393,  394 
when  data  are  grouped  or  rounded,  214-215 
of  x,  197,  198 

when  data  are  grouped  or  rounded,  196,  198, 
203,  212,  213 
Division  of  War  Research,  Columbia  University, 

vii 
Dodge,  Harold  F.,  8n.,  93 
Dosage-mortality  curve,  342,  343n.,  352 

design  of  experiments  for  determining,  339-352 
Double  dichotomy,  analysis  of  data  in  form  of, 
sequential,  424,  425 
single  sample,  250-258 

approximate  method  (xc),  253-255 
"exact"  method,  250-253 
design  of  experiments  leading  to,  258-265 
number  of  trials  needed,  258-261 
formula  for,  259,  260 
nomogram  for,  260 
operating  characteristics  (OC)  of  single-sample 

test,  261 
OC  curve,  figure  for,  263 

formula  for,  262 
OC  surface,  261,  262 
sequential  analysis  applied  to,  424,  425 
Double  sampling,  attributes  inspection  for  frac- 
tion defective,  237-244 
average    amount    of    inspection,    compared    to 
multiple  and  sequential  sampling,  238-249 
computational  methods  for,  references  to,  244 
figures  showing,  240,  242 
characterized  by  relation  of  rejection  number 

(Rk)  to  acceptance  number  (Ak),  237 
variability  of  amount  of  inspection  [<r(N)]t  com- 
pared to  multiple  and  sequential  sampling, 
238-244 
computational  methods,  references  to,  244 
figures  showing,  241,  243 
Dwyer,  Paul  S.,  172n.,  183 


Economic  time  series,  177 

Efficiency,   of  sample   statistics  as  estimators  of 
population  parameters,  227n. 
arithmetic  mean  as  estimator  of  /z,  162 

when  data  are  grouped  or  rounded,  197 
mean  deviation  as  estimator  of  a,  163 


Efficiency,  of   sample  statistics  as   estimators  of 
population    parameters,    median    as    esti- 
mator of  fj.,  in  normal  samples,  162 
range  as  estimator  of  a,  227 
s  as  estimator  of  a,  162,  227 
of  sampling-inspection   plans   and   procedures, 
attributes  versus  variables,  26-28 
double  sampling,  versus  multiple  and  sequen- 
tial, 237-244 
multiple  sampling,  versus  double  and  sequen- 
tial, 237-244 
sequential  sampling,   versus  non-sequential, 

26,  85,  237-244,  419,  420 
single   sampling,    versus   sequential,    26,    85, 
419,  420 
of  statistical  tools,  386m,  394 
Eisenhart,  Churchill,  iii,  ix-xii,  186,  268,  376,  384, 

396 
Elderton,  W.  Palin,  194n.,  222 
Enumeration  data  (see  Attributes;  Binomial  dis- 
tribution; Poisson  distribution) 
e:  probability  of  acceptance,  59 
e:  probability  that  a  normal  deviate  will  exceed 

K6,  17 
Error,  average,  162 

(See  also  Mean  deviation) 
components  of,  269,  297,  310-314 
of  first  kind,  271,  281,  282 
grouping  of,  187-189,  195,  218,  222 
of  interpolated  value,  218,  219,  222 
of  measurement,  188,  189 
probable,  187n.,  223 
random,  269,  297,  310 
of  second  kind,  271,  281,  282 
standard  (see  Standard  deviation) 
systematic,  222,  310 
of  tabular  value,  215-218,  221 
Estimate,  statistical  use  of  the  term,  12 
Estimation,  abandoned  prior  to  completion,  355, 
356 
by  confidence  intervals,  42,  43,  99,  100,  332-336 
effect  of  grouping  or  rounding  on,  192,  196- 
203,  213-215 
of  fraction  defective,  by  attributes  inspection, 
332-336 
by  variables  inspection,  42,  43,  58 
of  maximum  of  a  function,  363-372 
of  m  +  frf)  by  x  +  Jcs,  58 

of  population  mean  (fi),  from  probit  data  (ob- 
servations on  cumulative  distribution),  343, 
350 
by  sample  mean,  12,  58,  161,  162,  192 

effect  of  grouping  or  rounding  on,  187,  188, 
196-198,  200,  203,  212,  213 
by  sample  median,  161,  162 
from  sensitivity  data,  343,  350 
of   population   proportion    (P),    by    confidence 
belts,  332-336 
by  sample  proportion  (p),  397,  410 
from  transformed  proportion  (<p),  403,  404 
of  population  standard  deviation  (a),  from  pro- 
bit  data  (observations  on  cumulative  dis- 
tribution), 343,  350 
from  mean  deviation,  162,  163,  169 
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Estimation  of  population  standard  deviation  (a), 
from  s,  12,  58 
of  population  variance  (a2),  by  mean  deviation, 
162,  163,  169 
by  s»,  162 

when  data  are  grouped  or  rounded,   198- 
200,  212-214 
sequential,  reference  to,  238n. 
of  slope  and  intercept  of  regression  line,  when 
cost  of  each  measurement  depends  upon  its 
magnitude,  319-328 
when  probit  data  are  involved,  343 
of  statistical  tolerance  limits,  97-110 
"Exact"  method,  for  analyzing  2X2  table,  250- 

253 
Expected  value,  397,  408 

of  mean  deviation,  in  normal  samples,  169 
of  p,  in  binomial  samples,  397,  410 
of  <p(j>)  and  vB(p),  403,  404 

tables  giving,  400-402 
of  range,  in  normal  samples,  227n. 
of  s,  in  normal  samples,  58 
of  s2,  192,  412 

when  data  are  grouped  or  rounded,  199,  212, 
213 
of  x,  arbitrary  population,  197,  198 

when  data  are  grouped  or  rounded,  197,  198 
of  x  +  ks,  58 
Experiment  designs,  factorial,  356,  366,  367,  369- 
372 
definition  of,  366n. 

problems  for  which  ideally  suited,  366 
sequential,  417-429 

for  determining  maximum  of  a  function,  367- 

372 
when  failure  rate  of  equipment  is  high,  353- 

362 
(See    also    Sequential    analysis;    Sequential- 
sampling-inspection  plans) 
Experimental    method,    compared    to    standard 

method,  250 
Experimentation,  when  cost  of  each  measurement 
depends  upon  its  magnitude,  319-328 
when  failure  rate  of  equipment  is  high,  355,  356 
when  measurements  are  observations  on  cumu- 
lative distribution,  339-352 
role  of  randomization  in,  141,  142,  160 
role  of  replication  in,  159 

when  severity  of  any  one  test  cannot  be  con- 
trolled precisely,  331-338 
(See  also  Design  of  experiments;  Experiment 
designs;  Experiments) 
Experiments,    abandoning    prior    to    completion, 
provision  for,  355,  356 
for  ammunition-proofing  of  gun  barrels,  331-338 
metallurgical,  321 

(See  also  Design  of  Experiments;  Experimen- 
tation; Experiment  designs) 
Extreme  observations,  234 


F:  the  variance  ratio,  or  any  random  variable 
having  the  variance-ratio  distribution,  151, 
152,  155,  281 


F:  the  variance  ratio,  analysis  of  variance,  uses 
in,  311-314 
definition  of,  customary,  ambiguity  of,  151m. 
general,  281 

for  grouped  or  rounded  data,  215 
degrees  of  freedom  (ni.mim, vi),  ambiguity  re- 
garding, 151n. 
unambiguous  specification  of,  281 
distribution  of,  formula  for,  281 

percentage  points,  Paulson's  approximation, 
393,  394 
relation  to  percentage  points  of  x2-dis- 
tribution,  316 
Fisher's  z,  317n. 

incomplete  beta  function,  392,  393 
tables  of,  references,  151n.,  184,  3l7n.,  318 
comments  on,  151n.,  152n.,  317n. 
distribution  of  log  F,  when  degrees  of  freedom 

are  equal,  152 
e2*,  3l7n. 
formulas  for,  general,  281 

for  grouped  or  rounded  data,  215 
special,  151,  169 
relationship  to  x2,  215,  281,  316 
to  Fisher's  z,  317n. 
of  Hotelling's  T  to,  155 
to  incomplete  beta  function,  392,  393 
significance  levels  of,  Paulson's  approximation, 
393,  394 
relation  to  significance  levels  of  x2-distribu- 
tion,  316 
of  incomplete  beta  function,  392,  393 
tables   of,   references   to,    151n.,    184,    317m., 
318n. 
comments  on,  151n.,  152n.,  317n. 
test  of  homogeneity  of  sample  means,  385,  386 
compared  to  Bartlett's  test  for  homogeneity 

of  variance  estimates,  386 
discriminating  power  of,  improved  by  parti- 
tioning comparisons,  386 
as  "omnibus"  test,  386 
test  of  ratio  of  two  variance  estimates,  280-318 
use  in  planning  and  interpreting   experiments 
for  comparing  two  standard  deviation  or 
variance  estimates,  280,  283,  296 
for    detecting    presence    of    components    of 

variance,  311-314 
for  determining  whether  one  process  (prod- 
uct)  is  more  variable  than  another,  280- 
283, 296 
.F(N) :  any  function  of  N  that  is  independent  of  a, 

61 
/:  degrees  of  freedom  of  chi-squared  (x2),  75 
Factorial  design,  of  experiments,  356,  366,  367, 
369-372 
definition  of,  366n. 
situations  for  which  ideal,  366 
Factorials,  in   "exact"   analysis   of  2  X  2   table, 
251-253 
logarithms  of,  reference  to  tables  of,  253n. 
Failure  of  equipment,   allowance  for  rate  of,  in 

planning  experiments,  355-362 
Ferris,  C.  D.,  318n. 
Fiducial  limits  (see  Confidence  limits) 
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Figures,  list  of  principal,  xiv 
Fire-control  devices,  249 

Fisher,  Ronald  A.,  18,  93,  109,  116,  120n.,  148??., 
151,    152rc.,    159n.t    163n.,    167,    I77n.,     183, 
192??.,    197,   200,   201,   203,   211n.,   212,   218, 
219n.,   220,   221,   222,   232,   234,   250n.,   255, 
258n.,    265,    270n.,    297,    315n.,    317n..,    318, 
325n.,  328,  342??.,  343n.,  352,  366n.,  372,  398, 
399,  400,  405,  407,  413,  414n.,  415 
Fisher-Cornish  type  of  expansion,  109 
Five-point  OC  curve,  28-30,  427 
Flatness  of  probability  distribution  (see  Kurtosis) 
Forman,  H.  Irving,  x 

Formula,  for  A  (adjustment  constant),  41,  54,  70, 
71 
A  (critical  value  of  likelihood  ratio),  420 
A   (critical  value  of  normally  distributed  test 

function),  62 
A  (function  in  formulas  for  sequential  variables 

plan  for  fraction  defective),  84 
Ah,  in  terms  of  Rk,  237 
o  (critical  value  of  X  in  sequential  analysis),  84 

420 
o  (multiple  of  a  in  formula  for  mean  of  s),  58 
acceptance   criterion,   in   analysis   of   variance, 
312,  313 
for  difference  of  means,  61,  62 
for  difference  of  proportions,  263,  264 
general,  in  sequential  analysis,  420 
with  (large)  single  sample,  61,  62 
for  mean,  in  sequential  test,  423 

in  single-sample  test,  61,  62 
for  proportion,  in  sequential  test,  422 

in  single-sample  test,  260n.,  264 
when  range  in  samples  of  k  determines  lot 

quality  with  single  sample,  229,  232 
for  ratio  of  two  variance  estimates,  281,  283 
for  s2,  in  single-sample  test,  232,  270,  279 
in  variables  inspection  for  fraction  defective, 
12,  33-35,  37,  41,  69-71 
acceptance  line,  for  number  of  "successes,"  in 
binomial  test,  422 
for    number    of    "success-failure"    pairs,    in 

double-dichotomy  test,  425 
for  sum  of  observations,  in  test  for  n,  with  a 

known,  423 
for  sum  of  squared  deviations,  in  test  for  a, 
with  ix  unknown,  232 
adjusted  value  of  chi  (x<0,  254,  257 
adjustment  constant  (A),  41,  54,  70,  71 
adjustment  of  k,  N  fixed,  19,  20 
amount  of  inspection,  attributes  plan,  sequen- 
tial sampling,  427,  428 
single  sample,  259,  260n.,  263,  264 
binomial  test,  one-sided,  sequential  sampling, 
427,  428 
single  sample,  259,  260n.,  263,  264 
when  decisions  are  based  on  signs  of  differ- 
ences between  sample  means,  380 
difference  of  means  test,  a  known,  61,  62 

a  unknown,  17,  27n.,  28??.,  60-66,  81,  82 
difference  of  proportions  test,  259,  260,  263, 
264 


Formula,  amount  of  inspection,  double-dichotomy 
test,  259,  260,  263,  264 
fraction  defective  test  based  on  variables,  17 
when   ll  is    known   and  <r   unknown,    27??., 

28n.,  231,  278n. 
when   fx  is   unknown   and  <r  known,   27??., 

28n.,  62 
when  fx  and  a  are  both  unknown,  17,  27n., 
28n.,  60-63 
general  one-parameter  sequential  tests,  427 

single-sample  tests,  61,  62 
mean  test,  a  known,  27??..,  28n.,  62 

a  unknown,  17,  27??.,  28n.,  60-66,  81,  82 
proportion  test,  sequential  sampling,  427,  428 

single  sample,  259,  260n.,  263-264 
when  range  in  samples  of  A;  determines  lot 
quality,  sequential  sampling,  233 
single  sample,  231,  278n. 
standard  deviation  test,  sequential  sampling, 
233 
single  sample,  231,  278?i. 
variables  inspection  plans  for  fraction  defec- 
tive, 17 
when  ix  is  known  and  <r  is  unknown,  27??., 

28n.,  231,  278n. 
when  ix  is  unknown  and  <r  is  known,  27??.. 

28n.,  62 
when  ix  and  <r  are  both  unknown,  17,  27n., 
28??.,  60-63 
analysis-of- variance  table,  entries  of,  312 
angular  difference,  between  two  binomial  proba- 
bilities, 262,  264 
between  two  observed  proportions,  264 
angular  transformation,  258,  398,  399 

modified,  401 
arithmetic  mean,  12,  100,  101,  192 
asymptotic,  for  confidence  limits  for  population 
proportion,  336 
for  percentage  points  of  T-distribution,  155 
for  percentage  points  of  i-distribution,  148 
for  tolerance-limit  factor  K,  101 
ASN  curve,  for  general  one-parameter  sequen- 
tial tests,  427 
for  sequential  one-sided  binomial  test,    427, 
428 
with  acceptance  line  omitted,  358 
for  sequential  one-sided  test  of  <r,  233 
b  (critical  value  of  —X  in  sequential  analysis), 

84,  421 
b  (function  that  appears  in  formula  for  k  when 

JVis  fixed),  20,  59,  61 
b    (multiple    of   <r/-\/2(N  —  1  )    in   formula   for 

standard  deviation  of  s),  58 
Bartlett's    adjustment,    of    inverse-sine    trans- 
formation, 399 
Bartlett's  test  of  homogeneity  of  variance  esti- 
mates, 387,  388 
beta  function,  incomplete,  392 
binomial  distribution,  251 

boundaries  of  acceptance  and  rejection  regions, 
sequential  binomial  test,  422 
sequential  double-dichotomy  test,  425 
sequential  mean  test,  a  known,  423 
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Formula,  boundaries  of  acceptance  and  rejection 
regions,  sequential  procedure  for  determin- 
ing  whether   to   complete   or   abandon   an 
experiment,  356-361 
sequential    standard    deviation    test,    m    un- 
known, 232,  233 
single-sample  variables  inspection  plans  for 
fraction  defective,  35-37,  41,  53,  54,  56,  57, 
67-71 
c2,  58 
Xc  254 

X2,  254,  256,  257 
x2-distribution,  270 

Cochran's  test  for  significance  of  largest  vari- 
ance estimate,  388 
approximate  significance  levels  of,  392 
confidence  limits,  for  parameter  of  binomial  dis- 
tribution, 335,  336 
control  limits,  for  means,  147-148 
correlation  coefficient,  transformation  of,  413 
critical  level  (A)   of  normally  distributed  test 

function,  62 
degrees  of  freedom,  in  analysis  of  variance,  312 
of  T,  155 

of  T-statistics,  145,  152-154 
5  (f,to,e),  74 
for    estimating    variance    of    linear    regression 

fitted  by  weighted  least  squares,  343 
expected  frequencies,  in  2  X  2  table,  253 
expected  value,  of  mean  deviation,  in  normal 
samples,  169 
of  p,  in  binomial  samples,  397,  410 
of  s,  in  normal  samples,  58 
of  s2,  arbitrary  population,  197,  198 

when  data  are  grouped  or  rounded,  199 
of  x,  197,  198 
of  x  +  ks,  58 
F,  general,  281 

for  grouped  or  rounded  data,  215 
special,  151,  169 

71,  198 

72,  198 

general    transformation    to    stabilize    variance, 

409 
generalized  Student  ratio  (T),  154,  155 
hi  and  hi,  232,  233,  356,  357,  422,  423,  425 
Hotelling's  T,  154,  155 
interpolated  value,  error  of,  219 

variance  of,  219,  220 
interpolation  without  differences,  219 
inverse-sine  transformation,  258,  398,  399 

modified,  410 
K,  101,  108 
Ka,  263 
K0,  263,  264 
Ke,  17,  18,  31,  260 
k,  17,  19,  59,  60,  63,  81,  82 
kurtosis,  measures  of,  197,  198 
LP,  30,  51 
Ltr,  232,  233 
Lt,  262 

Lagrangian  interpolation  coefficients,  219 
to,  350 
mean  (see  Formula,  arithmetic  mean) 


Formula    moments,  of  arbitrary  distribution,  193 
when  data  are  grouped  or  rounded,  193,  195 
average,  194 
of  distribution  of  s2,  grouped  and  ungrouped 
data,  199 
of  /,  grouped  and  ungrouped  data,  199 
of  x,  grouped  and  ungrouped  data,  198 
N  (number  of  observations  needed  in  each  of 

two  samples),  259,  264,  380 
N    (number   of   observations   needed   in   single 
sample),  17,  26,  28,  60-62,  72,  73,  81,  231, 
259,  260n.,  264,  278n. 
N,  233,  427,  428 
Na,  26,  259,  260n.,  264 
Nm,  28n.,  61,  62 
N>,  28».,  231,  278n. 
Nt,  17,  28n.,  60,  61,  81 
non-central  t,  72,  73 
normal  distribution,  17,  18,  31,  161,  260,  263. 

423 
number  of  observations  needed,  with  attributes 
inspection,  sequential  sampling,  427,  428 
single  sample,  259,  260n.,  264 
binomial  test,  one-sided,  sequential  sampling, 
427,  428 
single  sample,  259,  260n.,  264 
X2  test  of  variance  estimate,  231,  278n. 
when  decisions  are  based  on  signs  of  differ- 
ences between  sample  means,  380 
for  determining  whether  level  of  performance 
of  a  process  (product)  exceeds  standard, 
17,  28n.,  60-62,  81 
whether   variability   of   performance   of    a 
process  (product)  exceeds  standard,  231, 
278n. 
difference   of   means   test,    single   sample,    a 
known,  61,  62 
a  unknown,  17,  28n.,  60,  61,  81 
difference    of    proportions    test,     sequential 
sampling,  424,  427,  428 
single  sample,  259,  260,  263,  264 
double-dichotomy  test,  sequential  sampling, 
424,  427,  428 
single  sample,  259,  260,  263,  264 
fraction  defective  test  based  on  variables,  17 
H  known,  cr  unknown,  28n.,  231,  278n. 
/u  unknown,  a  known,  28rc.,  61,  62 
fj.  and  a  unknown,  17,  28,  60,  61,  81 
general     one-parameter     tests,      sequential 
sampling,  427 
single  sample,  61,  62 
mean  test,  single  sample,  a  known,  28n.,  61, 
62 
a  unknown,  17,  28n.,  60,  61,  81 
proportion  test,  sequential  sampling,  427,  428 

single  sample,  259,  260n.,  264 
when  range  in  samples  of  A;  determines  lot 
quality,  sequential  sampling,  233 
single  sample,  231,  278n. 
for  setting  tolerance  limits,  references  to,  109, 

110 
standard  deviation  test,  sequential  sampling, 
233 
single  sampling,  231,  278n. 
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Formula,   number  of   observations  needed,  vari- 
ables inspection  plans  for  fraction  defec- 
tive, 17 
H  known,  a  unknown,  28n.,  231,  278n. 
H  unknown,  <r  known,  28n.,  61,  62 
H  and  a  unknown,  17,  28rc.,  60,  61,  81 
OC  curve,  for  analysis-of-variance  tests,  313 
attributes  inspection,  single  sample,  260n.,  262 
binomial  test,  one-decision,  sequential,  358 

one-sided,  single  sample,  260n.,  262 
X2  test  of  variance  estimate,  231,  232,  271, 

278n.,  279,  315 
when  decisions  are  based  on  signs  of  differ- 
ences between  sample  means,  379 
difference  of  means  test,  a  known,  61-62 

a  unknown,  30,  59-61,  62,  63,  72,  73 
difference  of  proportions  test,  single  sample, 

262 
double-dichotomy  test,  single  sample,  262 
experiments  for  determining  whether  level  of 
performance  of  one  process  or  product 
exceeds  standard,  30,  59-63,  72,  73 
whether    variability    of    one    process     or 
product  exceeds  standard,  sequential 
sampling,  233 
single  sample,  231,  232,  271,  278n.,   279, 
315 
whether    variability    of     one    process    or 
product  exceeds  variability  of  another, 
282,  316 
F-tests  of  variance  estimates,  282,  313,  316 
fraction   defective   test   based   on   variables, 

30,  59-63,  72,  73 
general   one-parameter  tests,   single  sample, 

61,  62 
mean  test,  single  sample,  <r  known,  61,  62 

a  unknown,  30,  59-61,  62,  63,  71-73,  76,  77 
one-decision  sequential  binomial  test,  358 
proportion  test,  single  sample,  260n.,  262 
when  range  in  samples  of  k  determines  lot 
quality,  sequential  sampling,  233 
single  sample,  231,  232,  271,  278n.,  279,  315 
standard  deviation  tests,  231,  232,  271,  278n., 

279,  282,  313,  315,  316 
variables   inspection   for   fraction   defective, 
30,  59-63,  72,  73 
<p  and  <pb,  399,  401 
<t>  (a,/3,ni,n2),  282,  316 

probability  that  sample  means  will  be  in  oppo- 
site order  to  population  means,  379 
probit,  variance  of  a,  350 

RP,  for  one-decision  sequential  binomial  test,  358 
r,  transformation  of,  413 
p  (ff,/9,n),  231,  232,  271,  278n.,  279,  315 
s  (slope  of  decision  line,  in  graphical  procedure 
for  sequential  tests),  233,  358,  422,  423,  425 
«  or  s2,  12,  43,  162,  192 

sample  size   (see  Formula,  number  of  observa- 
tions needed) 
significance  levels  of  Cochran's  test,  392 
of  T-distribution,  155 
of  ^-distribution,  148 
standard    deviation,    of    observed    proportion, 
255,  397 


101,  108 

and   a   from    probit 


Formula,  standard  deviation,  of  mean,  58 
of  s,  in  normal  samples,  58 
of  6,  258 
of  x,  58 

of  x  +  ks,  in  normal  samples,  58 
(See  also  Formula,  variance) 
Student's  t,  200 
Student's  z,  187n. 
T    (angular   difference   between   two   binomial 

probabilities),  262,  264 
T  (Hotelling's),  142,  154 
71-statistics,  131,  133,  135,  145 
t  (angular  difference  between  two  observed  pro- 
portions), 264 
t  (Student's),  200 

non-central,  72,  73 
t  (f,S,e),  74 

tolerance-limit  factor  (K), 
two-point   estimates   of   /* 
data,  350 
variances  of,  351,  352 
transformation  to  stabilize  variance,  409 

(See    also     Inverse-sine    transformation;    z- 
transformation) 
variance,    of    angular    difference    between    two 
observed  proportions,  264 
arithmetic  mean,  162 

when  data  are  grouped  or  rounded,  198 
binomial  distribution,  397,  410 
correlation  coefficient,  413 
"error"   due  to  grouping  or  rounding,   195, 

218 
grouped  or  rounded  variable,  195 
interpolated  value,  219 
mean,  162 

when  data  are  grouped  or  rounded,  198 
median,  162 
<p  and  ipb,  400,  407 
Poisson  distribution,  411 
probit,  350 
proportion,  397,  410 
r,  413 
s2,  arbitrary  population,  199 

when  data  are  grouped  or  rounded,  199 
t  (angular  difference  between  two  observed 

proportions),  264 
0  (transformed  proportion),  400 
two-point  estimates  of  n  and  a  from  probit 

data,  351,  352 
x,  162,  198 
variance  estimates,  12,  43,  162,  192 
x,  12,  43,  100,  192 

yardsticks,  for  differences  of  means,  150 
Fourfold  table  (see  Double  dichotomy) 
Fraction  defective,  definition  of,  10 

dependence  upon  fi  and  <r  of  items  in  lot,  10 
as  measure  of  lot  quality,  11,  42 

(See  also  Acceptance  in  pection,  for  fraction 
defective) 
Frankel,  Lester,  149n.,  155n.,  184 
Freedom,  degrees  of  (see  Degrees  of  freedom) 
Freeman,  Harold  A.,  ix,  419n.,  428 
Frequencies,  expected,  in  2  X  2  table,  253 
Frequency  distributions  (see  Distribution) 
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Friedman,   Milton,  ix,  xi,  xii,   6,   320,   340,   364, 

419n. 
Fry,  Thornton  C,  253n.,  265 
Fulcher,  John  S.,  257n.,  265 
Fulcher-Zubin  Item  Analyzer,  257,  258 
Function,  experimental  determination  of 

maximum  of,  363-372 


T:    confidence    coefficient   associated    with   confi- 
dence belts  for  parameter  of  binomial  distri- 
bution, 335,  336 
y.  a  confidence  coefficient,  42,  43,  98,   100,   108, 

332,  336 
yc:  confidence  coefficient  used  in  judging  whether 

to  continue  normal  inspection,  44 
yr:  confidence  coefficient  used  in  judging  whether 

to  resume  normal  inspection,  44 
71:  Fisher's  measure  of  skewness,  197,  198 
72:  Fisher's  measure  of  kurtosis,  197,  198 
Gamma  distribution  (see  Chi-squared  distribution) 
Gaussian  distribution  (see  Normal  distribution) 
Geary,  R.  C.,  201,  222 

General  one-parameter  tests,  sequential  sampling,, 
426-428 
single  sampling,  61,  62 
General  transformation  to  stabilize  variance,  409 
Generalized  normal  curve,  197 

(See  also  Gram-Charlier  series) 
Generalized  Student's  ratio  (T),  definition  of,  142, 
154,  155 
distribution  of,  154,  155 

asymptotic  expansion  for  percentage  points 

of,  155 
free  from  nuisance  parameters,  115 
relation  to  x2-distribution,  155 
to  F-distribution,  155 
optimum  properties  of,  147 

as  "score,"  combining  results  of  twd  or  more 
tests,  113-116 
Generalized  variance,  166 
Girshick,  Meyer  A.,  ix,  xi,  xii,  6,  226,  238n.,  244, 

354,  419 
Glaisher,  J.  W.  L.,  191,  222 
Glossary  for  Chap.  1,  85-92 
Goldberg,  Henry,  x,  83n.,  109,  149n.,  184 
Gosset,  William  Sealy,  187 

(*See  also  Student) 
Gram-Charlier  series,  197 

Graphical  procedure,  for  analyzing  binomial  data, 
260-262 
binomial-probability  paper,  reference  to,  414 
nomogram  for,  260 
Zubin's,  398,  416 
for  determining  whether  to  abandon  or  com- 
plete an  experiment,  356-359 
worked  example,  359-361 
in  sequential  analysis,  binomial  test,  one-deci- 
sion, 358 
one-sided,  422 
comparison  of  two  proportions,  425 
double-dichotomy  test,  425 
mean  test,  a  known,  423 
proportion  test,  422 


Graphical     procedure      in     sequential     analysis, 
standard  deviation  test,  fi  unknown,  232,  233 
in   variables  inspection  for  fraction  defective, 
34-37,  41 
adjustment  constant  (A),  36,  37,  40,  69-71 

optimum  choice  of,  41,  54,  70,  71 
advantages  of,  36 

boundaries   of  acceptance   and   rejection  re- 
gions, figures  showing,  38-40 
formulas  for,  35-37,  41 
derivation  of,  66—71 
impossible  region,  indicating  a  mistake,  36,  37 

figures  showing,  38-40 
when   upper   and   lower   limits    on    a   single 
quality  characteristic  define   acceptable 
quality,  52-56 
one  two-sided  criterion,  54-57 
pair  of  one-sided  criteria,  53,  54 
Grouped  data,  impossibility  of  avoiding  use  of, 

187,  188,  189 
Grouping  (of  data),  187,  188,  191-215 

effect  of,  on  descriptive  statistics,  187,  188 
on  distribution  of  F,  214,  215 
of  s2,  198-200,  203-214 
of  t,  200-203,  214,  215 
of  x,  196-198,  212,  213 
on  estimation,  of  population  mean,  197,  198, 
200-203 
population  variance,  198-200,  203-214 
on  moments,  193-196 

representation  as  effect  of  a  random  group- 
ing or  rounding  error,  195,  218 
on  standard  procedures   of  statistical  infer- 
ence, 191-215 
with  large  samples,  191-203 
with  small  samples,  203-215 
impossibility  of  avoiding,  187,  188,  189 
reasons  for,  187 
Grouping  interval,  recommended  choice  of  size  of, 

192,  214 
Grouping  lattice,  effect  of  position  of,  on  distri- 
bution of  s2,  206-213 
on  distribution  of  x,  212,  213 
on  estimation  of  ix  and  <r2  by  x  and  s2,  respec- 
tively, 212,  213 
random  location  of,  194 

extends   validity   of   Sheppard's   corrections, 

196 
when    only    non-negative    observations    can 
occur,  196' 
Groups,  variation  within  and  between,  312 
Grubbs,  F.  E.,  318n. 
Gun  barrels,  pressure  proofing  of,  331-338 

H 

h:  number  of  lots  from  which  samples  are  taken 

and  combined  for  computing  process  average, 

42 
hi:   negative   of  intercept   of  lower   decision  line 

(acceptance  line)  in  graphical  procedure  for 

sequential  tests,  426 
formula,  for  binomial  test,  one-sided,  422 

double-dichotomy  test,  425 

mean  test,  a  known,  423 
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hi:  negative  of  intercept  of  lower  decision  line 

(acceptance  line)  in  graphical  procedure  for 

sequential  tests,  formula,  standard  deviation 

test,  n  unknown,  233 

taken  equal  to  infinity,  to  provide  for  rejection 

only,  358 
use  in  constructing  five-point  OC  curve,  427 
h%:    intercept    of    upper    line    (rejection    line)    in 
graphical  procedure  for  sequential  tests,  426 
formula  for,  binomial  test,  one-decision,  358 
one-sided,  422 
double-dichotomy  test,  425 
mean  test,  a  known,  423 
standard  deviation  test,  n  unknown,  233 
use  in  constructing  five-point  OC  curve,  427 
Hall,  Philip,  218n.,  222 
Hartley,  H.  O.,  386n.,  394 
Hastay,  Millard  W.,  hi,  ix 
Hayashi,  Keiichi,  402n.,  416 
Histogram,  representation  of  population  to  which 

grouped  data  correspond,  193-195 
Homogeneity   of   binomial   distributions,    experi- 
ments for  testing,  249-265,  424,  425 
analysis  of  data  from,  sequential  sampling, 
424-425 
single  sample,  249-258 
planning  of,  sequential,  424 
single  sample,  258-265 
of  means,  61-62,  385-386 
of  proportions,  249-265,  424,  425 
of  2  X  2  table,  249-265 
of  variance  estimates,  385-389 

Bartlett's  test  for,  385,  386,  387,  388 
Cochran's  test  for,  386,  387,  388,  389 
significance  levels  of,  tables  of,  390,  391 
Homogeneous  variance,  advantages  of,  397,  398 
tests  for,  385-389 

transformation    of    data    to    achieve,    general 
formula  for,  408,  409 
examples,  410-413 
Hotelling,  Harold,  ix,  xi,  112,  114n.,  149n.,  154n., 
155n.,  172«.,  184,  365w.,  368n.,  371n.,  372,  398 
Hotelling's  T,  definition  of,  142,  154,  155 
distribution  of,  154,  155 

asymptotic  expansion  for  percentage  points 

of,  155 
free  from  nuisance  parameters,  115 
relation  to  x2-distribution,  155 
to  .^-distribution,  155 
optimum  properties  of,  147 

as  "score,"  combining  results  of  two  or  more 
tests,  113-116 
Hsu,  P.  L.,  147,  166n.,  184 
Hypotheses,    statistical   tests    of    (see    Statistical 

tests) 
Hypothesis,  alternative,  419,  420 
null,  420 
under  test,  419,  420 


Ignition  current,  341 

Immediate  acceptance,  criterion  for,  in  variables 

inspection  for  fraction  defective,  34,  37,  41, 

67,  69,  70 


Impossible  region,  indicating  a  mistake,  36,  37 

figures  showing,  38-40 
Inconsistent  requirements  in  specifications,  50n. 
Independence  of  acceptance  tests,  51 
tables  for,  249-255 

(See  also  Double  dichotomy;  2X2  tables) 
of  x  and  s  in  samples  from  a  normal  population, 
200 
effect  of  grouping  or  rounding  on,  187,  201 
Indifference  quality,  definition  of,  20 

use  in' variables  inspection  for  fraction  defec- 
tive, 21 
Indifference  value,  282 
Inferences    about    population     (see    Estimation; 

Statistical  tests) 
Information  about  lot,  7 

Inspection,    acceptance    (see   Acceptance    inspec- 
tion) 
amount  of  (see  Amount  of  inspection) 
cost  of  (see  Cost;  Amount  of  inspection) 
curtailed,  14,  238,  361 
destructive,  9,  321,  331,  341 
normal,  43,  44 
plan,    considerations    involved    and    steps    in 

choosing,  45-49 
reduced,  41-45 
sampling,  effectiveness  of,  156-158 

(See  also  Operating-characteristic  curves) 
objections  of,  156-158 
risks  involved,  228,  229 

(See  also  Consumer's  risk;  Producer's  risk) 
value  of,  dependent  on  quality  of  submitted 
product,  42 
for  several  quality  characteristics,  50-52,  113— 

116 
tightened,  41-45 
variability  of,  235-244 

by  variables  (see  Variables,  inspection  by) 
Interpolated  value,  accuracy  of,  219,  220n.,  221n. 
error  of,  219 

precision  or  variance  of,  219-221 
Interpolation,  218-221 

coefficients,  Lagrangian,  108,  109,  219,  222 

Salzer's  inverse,  109 
without  differences,  219 
harmonic  in  k  or  n,  317,  388 

in  y/n,  275,  316,  388 
inverse,  109 

probability  wise,  316n.,  318 
Interpreting,     analysis-of-variance     experiments, 
297,  310-314 
experiments    for    comparing    two    proportions, 
249-258 
for  comparing  two  standard  deviations,  267- 

318 
for  determining  whether  variability  of  new 
process    or    product    exceeds    standard, 
267-278 
is  less  than  standard,  278-280 
for  determining  whether  one  of  two  processes 

is  more  variable  than  another,  280-296 

experiments  when  severity  of  any  one  test 

cannot  be  controlled  precisely,  331-338 
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Inverse  hyperbolic  tangent,  413  (see  e-transforma- 

tion) 
Inverse-sine    transformation,    of    proportions,    to 
stabilize    variance,    257-259,    261-264,    336, 
395-416 
adjustment  of,  399,  401-407 

references  to,  399n.,  405n.,  407n. 
table  to  facilitate,  406 
Bartlett's  adjustment  of,  399,  401-407,  413,  414 
Bartlett's  form  of,  398,  400,  401 
bias  of,  403,  404 
bibliography  for,  414-416 
derivation  of,  410,  411 
effectiveness  of,  401-408 
figures  showing,  400-402 
tables  showing,  403-405 
forms  of,  Bartlett's,  398,  400,  401 
Bliss',  398,  400 

Fisher's  original,  398,  399,  400 
SRG's  first  version,  405,  406 
Yates',  398,  400 
Zubin's,  257,  258,  398,  405 
history  of,  398 
nature  of,  399-400 
presentation  of  final  results,  407 
tables  to  facilitate  application  of  (see  Bibliog- 
raphy, 414-416) 
use  to  determine  confidence  limits  for  binomial 
parameter,  336 
to    determine    number    of    trials    needed    in 
experiments  for  comparing  two  propor- 
tions, 259,  260 
formula  for,  259 

derivation  of,  261-264 
nomogram  based  on,  260 
use  to  determine  OC  curve  of  experiment  for 

comparing  two  proportions,  261,  262 
use  in  sequential  analysis,  references  to,  405n., 
415 
Irwin,  J.  O.,  218n.,  222 
Isaacson,  E.,  416 


J-shaped  curves,  194,  222 

Jennett,  W.  J.,  8n.,  49,  58n.,  72n.,  93,  99 

quotation  from  paper  by,  with  B.  L.  Welch,  49- 
50 
Johnson,  N.  L.,  8n.,  22,  64,  72,  73,  74,  76,  77,  79, 
86,  93 


K:  difference  between  upper  limit  (U)  specified 

for  a  variable  and  its  population  mean  (n) ,  in 

units  of  population  standard  deviation   (a), 

9-13 

K:  minimum  proofing  pressure  prescribed  for  gun 

barrels,  331 
K:  multiple  of  s  used  in  setting  tolerance  limits, 
97-100 
formulas  for,  101,  108 
table  of  values  for,  102-107 
uses  of,  examples  of,  100,  101 


Ka-  normal  deviate  exceeded  with  probability  a 
(producer's  risk,  or  probability  of  an  error  of 
first  kind),  17,  278n. 
formula  for,  263 
relation  (of  Ka2)  to  percentage  points  of  x2-dis- 

tribution,  258n. 
use    in     acceptance    inspection,     for     average 
performance,  61,  62 
for  dispersion  of  performance,  231,  232,  278n. 
for    fraction    defective,    by    attributes,    259, 
260n.,  261-264 
by  variables,  17-21,  60-63,  81,  82 
for    fraction    of    sample    ranges    less    than 

specified  limit,  231,  232 
for  mean  quality  of  lot,  61,  62 
for  standard  deviation  of  quality  in  lot,  231, 
232,  278n. 
use  in  formula  for  critical  level  (A)  and  number 
of  observations  needed  in  general  single- 
sample,  one-parameter  test,  61,  62 
use  in  planning  experiments  for  comparing  two 
proportions,  258-264 
for   comparing   two   standard   deviations   or 
variances,  278n. 
use  in   probability-wise  interpolations  in   per- 
centage-point tables,  316n. 
use  in  variables  inspection  for  fraction  defec- 
tive, 17-21,  60-63,  81,  82 
(See  also  K€) 
K(j:  normal  deviate  exceeded  with  probability  /3 
(consumer's  risk,  or  probability  of  an  error 
of  second  kind),  17,  278n.,  380 
formula  for,  263,  264 

use  to  determine  number  of  observations  needed 
when  decisions  will  be  based  on  signs  of 
differences  between  sample  means,  380 
(See  also  Ka,  uses  for;  Ke) 
Ke:  normal  deviate  exceeded  with  probability  e, 
17,  380 
formula  for,  17,  18,  31,  260 
relation  to  "probit,"  343 
tables  of,  references  to,  10n.,  18,  343n. 

need  for  care  in  using,  lOn. 
use   in    asymptotic    expansion    for    percentage 

points  of  i-distribution,  148 
uses,  other  (see  Ka  and  Kfi;  Ki-y;  Ko;  Ki;  K2) 
Ki-y:  normal  deviate  exceeded  with  probability 

1-7,  336 
Ko:    normal    deviate    exceeded    with    probability 

po,  21 
Kii    normal   deviate    exceeded    with    probability 

pi,  17 
Ki\    normal   deviate    exceeded    with    probability 

Vi,  17 
k:  coefficient  of  s  in  variables  inspection  plan  for 
fraction  defective,  12-14,  16-26 
adjustment  of,  when  N  is  fixed,  19,  20 
approximate  evaluation  by  formula,  17-20 

comparison  with  exact  values,  64-66 
exact  evaluation,  from  indifference  quality,  21 

using  non-central  ^-distribution  71-82 
formulas  for,  17-20 

derivation  of,  59,  60,  62-63 
table  of  values,  22-25,  65 
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k:  coefficient  of  s  in  variables  inspection  plan  for 
fraction  defective,  uses  of,  additional,  27n., 
2Sn. 
k:  number  of  items  the  range   (Rk)   of  which  is 

critical,  227,  228 
k:  number  of  samples  to  which  a  given  acceptance 
number    (Ak)    or    rejection    number    (Rk) 
corresponds,  237 
(See    also    Acceptance     number;     Rejection 
number) 
/fcx:  acceptance  level  for  ko  in  sequential  variables 

inspection  for  fraction  defective,  84,  85 
ko:  observed  difference  between  upper  limit  (U) 
and  sample  mean  (x)  in  units  of  s;  sample 
estimate  of  K,  43 
use    in    sequential    variables    inspection    for 
fraction  defective,  84,  85 
kti:  rejection  level  for  ko  in  sequential  variables 

inspection  for  fraction  defective,  84,  85 
k':    y/n    times    coefficient    of    s    in    acceptance 
criterion,    used    when   items    are   eventually 
assembled  in  sets  of  n,  48 
Kelley,  Truman  Lee,  18,  93 

Kendall,  Maurice  G.,  120n.,  184,  196n.,  199n.,  222 
Kohn,  Anne  Frankenthaler,  x 
Kurtc»6is,  measures  of  (02, 72),  197,  198 

of  distribution  of  error  of  an  interpolated  value, 
221 
of  sample  mean  when  data  are  grouped  or 
rounded,  197,  198 


L:  upper  limit  of  a  measured  quality  character- 
istic, above  which  an  item  is  defective,  72 
(See  also  U) 
LP:  probability  that  a  lot  of  fraction  defective  p 
will    be    accepted    by    a    given    acceptance 
inspection  plan  if  submitted,  15,  16,  28 
evaluation  of,  28-33,  50-52,  76,  77,  80,  358,  361, 

427 
use  of,  in  determining  confidence  limits  for  p, 
43-45,  336 
L<£:  probability  of  concluding  that  <T22  <  <n2  when 
o-22  =  <£<n2,  282 
curves   showing,   for  F-tests   of   variance   esti- 
mates, 296,  297,  318n. 
evaluation  of,  for  F-tests  of  variance  estimates, 
282-296,  316,  317 
Lp\  probability  of  concluding  that  <n2  <  <ro2  when 
<n2  =  po-o2,  278 
curves  showing,  for  x2-test  of  variance  estimate, 

274,  275,  318n. 
evaluation  of,  for  sequential  sampling  (see  La) 
for    single-sample    x2-test,    exact    procedure, 
271-278,  279,  315 
large-sample  formula  for  (see  Lff) 
La'  probability  of  concluding  that  0-2  <  a\  when 
er2  =  a,  232 
curve  showing,  for  sequential  test,  234 
evaluation  of,  for  sequential  sampling,  233 
for  single-sample  x2-test,  exact  procedure,  232 
large-sample  formula  for,  232 


Lt:  probability  of  concluding  that  Pe  >  Ps  when 
true   angular  difference  between  Pe  and  Pa 
is  T,  262 
curve  showing,  262 
evaluation  of,  by  formula,  262 
from  nomogram,  263 
L0:  probability  of  concluding  from  likelihood  ratio 
sequential  test  that  6  <  do,  as  a  function  of 
the  true  value  of  0,  427,  428 
Lfi,ff:  probability  that  a  lot  of  mean  n  and  standard 
deviation  a  will  be  accepted  by  a  given  ac- 
ceptance inspection  plan  if  submitted,  16,  52 
Lagrangian   interpolation    coefficients,    108,    109, 

219,  222 
X:  natural  logarithm  of  ratio  to  two  probabilities 
or  probability  densities,  computed  after  each 
observation  in  sequential  analysis,  83 
X  (f,to,e) :  function  of  /,  to,  and  e  used  in  calculation 
of  8  (f,to,e)  and  t  (f,8,e)  from  tables  of  non- 
central  ^-distribution,  74,  75 
Xi:   value   of  X   (f,to,e)    computed  in  determining 

Pi-a,  82 
X2:  value  of  X  (f,to,e)  computed  in  determining  pp, 

82 
Landau,  E.,  408n. 
Langdon,  W.  H.,  195n.,  196n.,  222 
Large  samples,  formulas  and  procedures  valid  for, 
17-21,  30-33,  43,  58-63,  81,  82,  101,  108,  109, 
148,  149n.,  155,  162-163,  169,  231,  232,  263, 
264,  278n.,  336,  377-381,  393,  399-401,  407, 
408-413 
accuracy  of,  64,  65,  401-407 
when  data  are  grouped  or  rounded,  196-203 
Least  squares,  application  of,  when  cost  of  each 
measurement   depends   upon  its   magnitude, 
319-328 
vdien  data  are  observations  on  cumulative  dis- 
tribution, 343,  344 
references  to,  343n.,  352 
Lethality  of  drugs,  341 
Level  of  significance,  270,  271,  281 
Levine,  Harriet,  x,  6,  109,  149n.,  184 
Levine,  Myra,  x,  xi,  236 

Likelihood  ratio,  as  basis  for  sequential  tests,  83, 
419-421 
definition  of,  420 

formula  for  logarithm  of  (X),  in  sequential  bi- 
nomial test,  421 
in  sequential  double-dichotomy  test,  424 
in  sequential  mean  test,  a  known,  423 

a  unknown,  84 
in  sequential  variables  inspection  for  fraction 
defective,  84 
Limits,  confidence  (see  Confidence  limits) 
control  (see  Control  limits) 
of  normal  variation  (see  Tolerance  limits) 
tolerance  (see  Tolerance  limits) 
Lindley,  D.  V.,  281n. 

Linear  regression,  when  cost  of  each  measurement 
depends  upon  its  magnitude,  319-328 
when  data  are  observations  on  cumulative  dis- 
tribution, 343,  344 
references  to,  343n.,  352 
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Location,  estimation  of  parameters  of  (see  Esti- 
mation; Mean;  Median) 
Logarithmic  scale,  321 
Logarithmic  transformation,  412-413 

(See  also  ^-transformation) 
Logarithms,  of  factorials,  253 
reference  to  tables  of,  253n. 
Lot,    acceptability    of,    determined    by    average 
quality,  8,  27,  28,  61-63 
by  dispersion  of  quality,  8,  27n.,  28n.,  229, 

269,  270,  278-280,  283 
by  fraction  defective,  7,  8 
by  fraction  of  averages  of  samples  of  n  that 

exceed  specified  limit,  48 
by  fraction  of  ranges  Rk  in  samples  of  k  that 
exceed  specified  limit,  228,  229 
formulation  of,  in  terms  of  a,  229-231 
by  mean  quality  (/*),  27,  28,  61-63 
by  mean   (n)  and  standard  deviation   (a)   of 

quality,  11 
by  several  quality  characteristics,  50-52,  113, 

114 
by  standard   deviation   of   quality,   8,   27n., 

28n.t  229,  269,  270,  278-280,  283 
by  upper  and  lower  limits  on  a  single  quality 

characteristic,  52-56,  63,  64 
(See    also    Acceptable    quality;    Acceptable- 
quality  level) 
acceptance  of,  meaning  of,  13-14 

procedure  for  (see  Acceptance  inspection) 
judgment   of,   from  a  sample   or   samples    (see 
Acceptance   criterion;   Acceptance   inspec- 
tion) 
quality  of,  choice  of  standards  for,  46-49 

specification  of  (see  Lot,  acceptability  of) 
rejection  of,  meaning  of,  13,  14 
submitted,     probability     of     acceptance     (see 

Operating-characteristic  curve) 
variation  of,  from  item  to  item,  8 
Lot  tolerance  percent  defective  (LTPD;  ps),  defi- 
nition of,  45,  229 
difference  between  true  and  intended,  in  varia- 
bles inspection  for  fraction  defective,  21-26 
causes  of,  21,  22 
discussion  of,  21,  22,  25,  26 
table  showing,  22-25 
selection  of,  considerations  in,  45-49 
I.owan,  Arnold  N.,  x 

(See  also  Mathematical  Tables  Project) 
LTPD  (see  Lot  tolerance  percent  defective) 


M 


M :  minimum  value  of  a  measured  quality  charac- 
teristic, below  which  an  item  is  defective,  13 

m:  two-point  estimate  of  y.  based  on  probit  data, 
350 
formula  for,  350 
variance  of,  351 

with  optimum  arrangement,  352 

Marginal  totals,  held  fixed  in  analysis  of  2  X  2 
table,  251 

Martin,  Celia  S.,  x 

Massachusetts  Institute  of  Technology,  ix 


Mathematical  Tables  Project,  x,  10n.,  31n.,  83n., 
93,  108,  109,  209n.,  219n.,  222,  255n.,  260n., 
265 
Mather,  K.,  414n. 

Matrices,  numerical  inversion  of,  172-173 
Maximum  of  a  function,  365 

design  of  experiments  for  determining.  363-372 
factorial,  366,  367,  369-372 
when  form  of  function  is  known.  371 
sequential,  367-372 
single  sample,  reference  to,  365n.,  372 
use  of  preliminary  sample  in,  371 
Mean,  arithmetic  (see  Arithmetic  mean;  Sample 
mean) 
of  binomial  distribution,  397,  410 
of  grouped  distribution,  193,  194 
of  lot  (ju),  as  measure  of  lot  quality,  8,  27,  28, 

61-63 
of  normal  distribution,  9,  12,  61,  97,   161,  197, 
198,  ,342,  378,  423 
variance    of   maximum   likelihood    estimator 
of,  when  data  are  grouped,  197 
of  Poisson  distribution,,  411 

of    population    (/j),    bias    of    sample    mean    as 
estimator   of,   when  data  are  grouped   or 
rounded,  187,  188,  212,  213 
confidence  limits  for,  a  known,  197,  198 

a  unknown,  99,  200-203,  213-215 
decisions  regarding,  based  on  sign  of  differ- 
ence between  sample  means,  377-382 
estimation  of,  by  sample  mean,  162,  187,  188, 
192,  196-198,  203,  212,  213 
from  observations  on  cumulative  distribu- 
tion, 342-352 
optimum    design    of    experiments    for, 
345,  346,  349-356 
importance    of    distinguishing    from    sample 

mean  (x),  12 
as  parameter,  12,  161 

tests  of  significance  regarding,  sequential,  a 
known,  423 
<r  unknown,  83-85,  426 
single  sample,  a  known,  61,  62 

when  data   are  grouped   or   rounded, 
197,  198,  212,  213 
a  unknown,  61-63,  71-73 

when   data   are   grouped   or  rounded, 
200-203 
variance  of  maximum  likelihood  estimator  of, 
when  data  are  grouped  or  rounded,  197 
of  r,  413 
of  s,  58 
of  S2,  192,  412 

when  data  are  grouped  or  rounded,  199,  212, 

213 

of  sample,  as  estimator  of  population  mean,  12, 

162,  187,  188,  192,  196-198,  203,  212,  213 

bias  of,  when  data  are  grouped  or  rounded, 

187,  188,  203,  212,  213 
(See  also  Expected  value) 
Mean     deviation      (mean     absolute     deviation), 
definition  of,  162 
as  estimator  of  <r,  162,  163,  169 
Mean  error  (see  Mean  deviation) 
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Mean  squares,  in  analysis  of  variance,  312 
Measurement,  errors  of,  188,  189 
Median,  of  sample,  9 

as  estimator  of  population  median,  162 
Merrington,  Maxine,  148n.,  151ri.,  152«.,  184,316, 

317,  318,  388rc.,  393,  394 
Metallurgical  experiments,  321,  365 

design    of,    when    cost    of    each    measurement 
depends  upon  its  magnitude,  322-328 
when    object    is    to    determine    composition 
yielding  maximum  performance,  363-372 
Molina,  E.  C,  109 
Moments,  about  origin,  193 

effect  of  grouping  or  rounding  on,  193,  194 
about  mean,  195 

effect  of  grouping  or  rounding  on,  195,  196 
average,    when   grouping  lattice   is   located   at 

random,  194,  195 
centroidal,  195 

effect  of  grouping  or  rounding  on,  195,  196 
of  distribution  of  s2,  when  data  are  grouped  or 
rounded,  199 
of  sample  mean,  when  data  are  grouped  or 

rounded,  198 
of  t,  when  data  are  grouped  or  rounded,  201, 
202 
of  grouped  distributions,  193-196 

average,  when  grouping  lattice  is  located  at 

random,  194,  195 
effect  of  location  of  rounding  lattice  on,  194, 
195,  196,  206-213 
ideal,  193 
raw,  193 
realistic,  193 
Mosteller,  Frederick,  238n.,  244,  414n. 
n:  mean  of  lot  or  population  (see  Mean,  of  lot;  of 

population) 
Multiple  of  s,  for  setting  tolerance  limits  (see  K) 
in   variables   inspection   for   fraction   defective 
(see  k) 
Multiple  of  a,  determined  by  AQL  and  U,  11 
determined  by  difference  between  U  and  n,  9-13 
determined  by  proportion  of  population  to  be 
included  between  tolerance  limits,  97 
Multiple  sampling,  attributes  inspection  for  frac- 
tion defective,  237-244 
average    amount    of    inspection    (ASN),    com- 
pared to  double  and  sequential  sampling, 
238-244 
computational  methods,  references  to,  244 
figures  showing,  240,  242 
characterized  by  relation  of  Rk  to  Ak,  237 
variability  of  amount  of  inspection  <r(N)  com- 
pared    to     double     sequential     sampling, 
238-244 
computational  methods,  references  to,  244 
figures  showing,  241,  243 
Multivariate  quality  control,  applied  to  air  testing 
of  bombsights,  113-184 
analysis  of  certain  test  bombings,  168-183 
arrangement  of  bombings,  121,  122 
control  charts,  122-142 

actions  to  be  taken  when  "out  of  control," 
check  list  of,  140 


Multivariate     quality    control,     control     charts, 
figures  showing,   124,   126,   128,    130,   132, 
134,  136 
formulas   and   procedures    for    constructing, 

147-156 
illustrations  of  use  of,  135-142 
revision  of,  considerations  affecting  and  pro- 
visions for,  142-147 
simultaneous    use    of    several    sample    criteria, 

50-52,  113,  114 
use  of  single  "score"  or  "index,"  51,  113,  114 
based  on  generalized  Student's  T,  114-116 
Muzzle  velocity,  tolerance  limits  for,  100,  101 


N 


N:    sample    size    needed,    in    analysis-of-variance 
tests,  311-314 
in  attributes  inspection,  259,  260n.,  263,  264 
when  severity  of  any  one  test  cannot  be  con- 
trolled precisely,  331-338 
for  binomial  test,  one-sided,  259,  260n.,  263,  264 
for  x2  test  of  variance  estimate,  231,  274,  275, 

278 
for  determining  whether  level  of  performance  of 
a  process  or  product  exceeds  standard,  17, 
28n.,  59-63,  81 
for    determining    whether    variability    of    per- 
formance of  a  process  or  product  exceeds 
standard,  231,  274,  275,  278 
is  less  than  standard,  278-280 
for  fraction  defective  tests,  based  on  attributes, 
259,  260n.,  263,  264 
based  on  variables,  17 

when  m  is   known   and  a  unknown,   27?!., 

28n.,  231,  274,  275,  278n. 
when   ix  is   unknown   and   <r   known,    27n., 

28n.,  61,  62 
when  fi  and  a  are  both  unknown,   17—21, 
27n.,  28n.,  59-66,  81,  82 
in  general  one-parameter  tests,  61,  62 
for  mean  test,  a  known,  27n.,  28n.,  61,  62 
a  unknown,  17,  27n.,  28n.,  59-63,  81,  82 
for  proportion  test,  259,  260n.,  263,  264 
when    range   in    samples    of    k   determines   lot 

quality,  231 
for    standard    deviation    tests,    231,    274,    275, 

278-280 
in  variables  inspection   for  fraction   defective, 
17-21,  27n.-28n.,  59-66,  81,  82,  231,  274, 
275,  278n. 
N:  size  of  each  of  two  samples  needed,  for  com- 
parison of  two  means,  a  known,  61—62,  380 
a  unknown,  17,  27n.,  28n.,  59-63,  81,  82 
for  comparison  of  two  proportions  or  percent- 
ages, 259,  260,  263,  264 
nomogram  showing,  260 
for  comparison  of  two  standard  deviations  or 

variances,  280,  282,  283,  296 
when  decisions  are  based  on  signs  of  differences 

between  sample  means,  380 
for  determining  whether  level  of  performance  of 
one    process    or    product    exceeds    that    of 
another,  17,  27n..  28n.,  59-63,  81,  82,  380 
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N:  size  of  each  of  two  samples  needed,  for  deter- 
mining whether  variability  of  performance  of 
one    process    or    product    exceeds    that     of 
another,  280,  282-283,  296 
for  difference  of  means  test,  a  known,  61,  62, 
380 
<r  unknown,  17,  27n.,  28n.,  59-63,  81,  82 
for    difference    of    proportions    or    percentages 
test,  259,  260,  263,  264 
nomogram  showing,  260 
for  double-dichotomy  test,  259,  260,  263,  264 

nomogram  showing,  260 
for  F- tests  of  variance  estimates,  280,  282,  283, 
296 
"N:  average  number  of  observations  needed,  233, 
238,  420,  427 
(See  also  ASN  curve) 
Na:  sample  size  needed  with  single-sampling  attri- 
butes inspection  plan,  26,  27,  259,  260n.,  263, 
264 
Nn  number  of  observations  in  i-th.  sample  of  a 

group  of  k  samples,  43 
Nm:  size  of  sample  needed  with  variables  inspec- 
tion  plan   for   fraction   defective   when  <r   is 
known,  27,  28n.,  61,  62 
Ns:  size  of  sample  needed  with  variables  inspec- 
tion  plan   for   fraction   defective   when   n  is 
known,  27n.,  28n.,  231,  274,  275,  278 
Nt:  size  of  sample  needed  with  variables  inspec- 
tion plan  for  fraction  defective  when  n  and  a 
are  both  unknown,  17-21,  26-28,  27n.,  28ra., 
59-63,  81,  82 
N':  number  of  sets  of  n  items  each  in  a  sample,  48 
n :  degrees  of  freedom,  in  analysis  of  variance,  313 
of  x2,  231,  270,  316,  387-389 
of  s  or  s",  150,  152,  200,  203,  231-232,  387,  388- 

389 
of  T,  145,  155 
of  t,  148,  150 
n:  number  of  items  constituting  a  set  which  is 
judged  defective  or  nondefective  on  the  basis 
of  the  arithmetic  mean  of  the  set,  48 
n:  number  of  variables  upon  which  the  quality  of 

an  item  depends,  50 
n:  size  of  preliminary  sample  in  attributes  inspec- 
tion for  fraction  defective  when  severity  of 
any  one  test  cannot  be  controlled  precisely, 
331,  336-338 
N  -  1  versus  N,  13,  58 
Nair,  U.  S.,  386n.,  393 

National  Bureau  of  Economic  Research,  iii,  ix 
National  Bureau  of  Standards,  iii,  ix-xii,  93,  109, 
186,  222,  265,  268,  376,  384,  396 
(See  also  Mathematical  Tables  Project) 
National  Defense  Research  Committee,  vii 

Applied  Mathematics  Panel,  vii,  x 
Navy,  viii 

New  method,  process,  or  product,  compared  with 
old,  with  respect  to  fraction  defective,  249- 
264,  424,  425 
with  respect  to  level  of  performance,  59—63, 

380 
with  respect  to  variability  of  performance, 
280,  282,  283,  296 


New  method,  process,  or  product,  compared  with 

standard,  with  respect  to  fraction  defective, 

7-93,  27n.,    28n.,   231-234,   237-244,   270. 

271,  274,  275,  278-280,  423 

with  respect  to  level  of  performance,  17,  27n., 

28n.,  59-63,  81,  82,  423 
with  respect  to  variability  of  performance, 
227-234,  270,  271,  274,  275,  278-280 
New      York       Mathematical       Tables       Project 
(NYMTP)  (see  Mathematical  Tables  Project) 
Newton's  method,  109 
Neyman,  Jerzy,  271n.,  318 
Nomogram,  for  analyzing  binomial  data,  260 

Zubin's,  references  to,  398,  416 
Nomograph  (see  Nomogram) 
Non-central  t,  8n.,  22,  64-66,  71-82,  83n.,  85n. 
definition  of,  71-73 
distribution  of,  72,  73 

Johnson  and  Welch  tables  of,  description  of, 
73,  74 
notation  used  in,  72,  73 
procedures  for  using,  74-76 
relation  to  Student's  t,  72 

use  in  sequential  variables  inspection  for  frac- 
tion defective,  83n.,  85n. 
use   in   single-sample   variables   inspection   for 
fraction  defective,  64-66,  71-82 
for  evaluating  k,  procedure  for,  76,  81,  82 
worked  example,  77-79 
work  sheets,  77-79 
far  evaluating  N,  procedure,  81,  82 
for  evaluating  points  on  OC  curve,  procedure, 
76,  77 
worked  example,  80 
work  sheet,  80 
Nondefective,  7 

Non-normal  distributions,   50,   51,  99,   377,   378, 
381,  382 
best  discriminant  functions  for  discriminating 

between,  381,  382 
tolerance  limits  for,  99,  109 

(See  also  Normality  assumption) 
Normal  deviates,  exceeded  in  absolute  value  with 
given  probability,  148 
relation  to  percentage  points  of  x?-distribu- 

tion,  258n. 
tables  of,  references  to,  10n.,  18n.,  148rc.,  149n. 
exceeded  with  given  probability   (see  Ka,  K@, 
Ki-y,  K0,  Kx,  and  K2) 
relation  to  probits,  343 
short  table  of,  18 

tables  of,  references  to,  10n.,  18,  343n. 
need  for  care  in  using,  10n. 
probabilities     corresponding    to     given,     short 
table  of,  31 
tables  of,  references  to,  10n.,  31 
need  for  care  in  using,  lOn. 
Normal  distribution,  8,  9-13,  17,   18,  21,   31,  49, 
50,  58,  61,  62,  64,  73,  83,  97,  98,  100-109, 
161-163,  197-214,  221,  229,  255,  263,  264, 
312,  336,  341-344,  350,  377-381,  412,  413, 
423,  426 
(See  also  Normal  deviates ;  Normality  assump- 
tion; Normally-distributed  test  function) 
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Normal  distribution,  basis  for  "probits,"  343 
bias  of  mean  deviation  as  estimator  of  standard 

deviation  of,  169tz. 
bias  of  s  as  estimator  of  standard  deviation  of, 
58,  162 
adjustment  factor,  formida  for,  58 
tables  of,  references  to,  58n. 
bias  of  s2  as  estimator  of  variance  of,  when  data 

are  grouped  or  rounded,  199 
in  biological  assay,  342 
bivariate,  146 

correlation  coefficient  of,  413 
relation  of  T  to  exponent  of,  114-115 
characteristics  of,  9-13,  97,  98,   161-163,   198, 
199,  200,  202,  221,  229,  312,  378,  379,  381 
cumulative,  estimating  y.  and  a  from  data  on, 
339-352 
optimum  design  of  experiments  for,  345,  346, 
349-352 
departures  from,   seriousness   of,   in  statistical 
analyses,   49,   50,   99,    198,    199,   202,  346, 
377,  381 
dependence     of     proportion     exceeding     given 
limit  on  the  parameters  of,  10 
(See  also  Normal  deviates) 
in  dosage-mortality  studies,  342 
of   errors   of   tabular   values,    advantages   that 

would  accrue  from,  221 
estimation  of  the  parameters  of,   12,   58,   161- 
163,  169 
from   grouped    or   rounded    data,    187,    188, 

196-203,  212-214 
from    observations    on   cumulative   distribu- 
tion, 339-352 
expected    value,    of    mean    deviation    in    large 
samples  from,  169 
of  s  in  samples  from,  formula  for,  58 
of  x  +  ks  in  samples  from,  58 
figure,  schematic,  10 
formula  for,  161,  423 

for  partial  integral  of,  17,  18,  31,  260,  263 
generalized,  197 

(See  also  Gram-Charlier  series) 
grouped,    distribution   of   s2  in   samples   from, 
198-200,  203-214 
of  t  in  samples  from,  200-203,  214,  215 
of  x  in  samples  from,  196-198,  203,  212,  213 
independence  of  x  and  s2  in  samples  from,  200 

effect  of  grouping  on,  187,  188,  201 
kurtosis,  measured  by  /3>  or  yz,  197,  198 

of  distribution  of  means  from,  when  data  are 
grouped  or  rounded,  197,  198 
mean  of  (M),  9-12,  61,  97,  161,  162,  197,  198, 
342,  378,  423 
estimation  of,  from  observations  on  cumula- 
tive distribution,  342-352 
from  probit  data,  342-352 
by  sample  mean,  162,  187,  188,  192,  196- 

198,  203,  212,  213 
by  sample  median,  162 
tests  of  significance  regarding,  sequential,  a 
known,  423 
a  unknown,  83-85,  426 


Normal  distribution,  mean  of  (n),  test  of  signifi- 
cance regarding,  single-sample,  a  known, 
61,  62,  197,  198,  212,  213 
0-  unknown,  61-63,  71-73,  200-203 
variance  of  maximum  likehood  estimator  of, 
when  data  are  grouped  or  rounded,  197 
parameters  of,  estimation  of  (see  Normal  distri- 
bution, estimation  of  parameters  of;  mean 
of;  standard  deviation  of;  variance  of) 
notation   for,    emphasizing   distinction   from 

sample  estimates,  12 
tests   of    (see   Normal   distribution,    tests   of 
significance  regarding  parameters  of;  mean 
of;  standard  deviation  of;  variance  of) 
range  in  samples  of  k,  cumulative  distribution, 
figure  showing,  230 
tables  of,  references  to,  229,  230,  234 
distribution  of,  234 

depends  only  on  k  and  <r,  229 
as  estimator  of  a,  227 
mean  values  of,  227n.,  234 
standard  deviation  of,  227n.,  234 
standard  deviation  of   (a),  9-12,  97,  161,  229, 
342,  426 
estimation  of,  from  mean  deviation,  162,  163, 
169 
from  observations  on  cumulative  distribu- 
tion, 342-352 
from  probit  data,  342-352 
from  range,  227 
from  s,  12,  58 

(See  also  Normal  distribution,  variance 
of) 
tests    of    significance    regarding,    sequential, 
232,  233,  426 
single-sample,  200,  231,  270,  271,  274,  275, 
278-280 
standard  deviation  of  s  in  samples  from,  58 

of  x  +  ks  in  samples  from,  58 
tables  of,  18,  31 

need  for  care  in  using,  lOn. 
references  to,  10n.,  18,  31,  148n.,  149n.,  343n. 
types  of,  lOn. 
tests   of  significance  regarding  parameters  of, 
sequential,  83-85,  232,  233,  423,  426 
single-sample,  61-63,  71-73,   197,  198,  200- 
203,  212,  213,  231,  270,  271,  274,  275, 
278-280 
(See  also    Normal   distribution,    mean   of; 
standard  deviation  of;  variance  of) 
variance  of,  161,  198,  199,  200,  212-214 
definition  of,  161 

estimation  of,  by  mean  deviation,   162,  163, 
169 
by  s2,  162,  198-200,  203-214 
tests  of  significance  regarding,  sequential, 
232,  233,  426 
single  sample,  200,  231,  270,  271,  274, 
275,  278-280,  282,  283,  296 
(See  also  Normal  distribution,  standard  devi- 
ation of) 
Normal  inspection,  43-45 

acceptable-quality  level  for,  choice  of,  45-49 
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Normal  law  (see  Normal  distribution) 
Normal-probability  function,  tables  of,  10n.,  18, 
31,  148n.-149n.,  343n. 
need  for  care  in  using,  lOn. 
types  of,  10n. 
Normal-probability  paper,  414 
Normal  surface  (see  Normal  distribution,  bivari- 

ate) 
Normality  assumption,  avoided  by  use  of  attri- 
butes inspection,  8,  229n. 
by  use  of  distribution-free  tolerance  limits, 
99,  109 
dependence  upon,  of  analysis  of  probit  data,  346 
of  moments  of  ^-distribution,  201 
of  OC  curve  for  variables  inspection  plan  for 

fraction  defective,  49,  50 
of  tolerance-limit  factor  K,  99 
of  xi  —  X2  as  best  decision  function  regarding 
sign  of  fii  —  H2,  381 
importance  of,  in  air  testing  of  bombsights,  146 
in  analysis  of  sensitivity  data,  346 
in  evaluating  probability  that  sample  means 
will   be    in    opposite    order  to   population 
means,  377,  378 
in  setting  tolerance  limits,  99 
in  variables  inspection  for  fraction  defective, 
49,  50 
use  of,  when  severity  of  any  one  test  cannot  be 
controlled  precisely,  336 
Normally  distributed  test  function,  derivation  of 
critical  level  (A)  and  number  of  observations 
needed  (N)  for,  61,  62 
Notation,   as   means   of   emphasizing   distinction 
between     parameters     of     population     and 
sample  estimates  of  these,  12 
used  in  variables  inspection  for  fraction  defec- 
tive, 85-96 
Nuisance  parameters,  115 
Null  hypothesis,  420 

in  analysis  of  variance,  311,  312 

in  mean  tests,  386,  423 

in  proportion  tests,  249,  251,  264,  357,  358,  421, 

424 
in  standard  deviation   or  variance   tests,   270, 
278,  281,  386,  387,  311,  312 


OC  (see  Operating  Characteristics) 
OC  curves  (see  Operating-characteristic  curves) 
OC  surface  (see  Operating-characteristic  surface) 
Office   of    Scientific   Research   and   Development 
(OSRD),  vii 
(See  also  National   Defense  Research  Com- 
mittee) 
Omnibus  tests,  386 

Old  method,  process,  or  product  compared  with 
new,    with   respect   to   fraction   defective, 
249-264,  424,  425 
with  respect  to  level  of  performance,  59-63, 

380 
with  respect  to  variability  of  performance, 

280,  282,  283,  296 
(See  also  Standard) 


One-decision  sequential  binomial  test,  358 
One-sided  criteria,  use  of  a  pair  of,  in  variables 
inspection  for  fraction  defective,  53,  54 
compared  to  use  of  a  single  two-sided  criterion, 

52,  56 
figure  showing  acceptance  and  rejection  regions 
in  (x,  s)-plane,  55 
One-sided    or    one-tail    tests,    of    statistical    hy- 
potheses, 52-57,  150,  152,   156,  426 
Operating    characteristics    (OC),    15,    16,    28-32, 
261-263,  271,  278 
definition  of,  14,  261,  271 
Operating-characteristic  curves  (OC  curves),   15, 
29,  234,  263,  274,  275,  296,  297,  379 
for  analysis-of-variance  tests,  313,  314 
for  attributes  inspection,  sequential  sampling, 
426,  427 
single  sample,  260n.,  262,  263 
when  averages  in  samples  of  n  determine  lot 

quality,  48 
for  binomial  test,  one-decision  (RP),  sequential, 
358 
one-sided,  sequential,  426,  427 

single  sample,  260n.,  262,  263 
truncated,  sequential,  361 
for   x2-test   of   variance  estimate   (Lp,Lff),  231, 
232,  271-279,  315,  318n. 
figures  showing,  274,  275,  315n. 
large-sample  formula  for,  232 
tables  for  constructing,  272,  273,  276,  277 
when  decisions  are  based  on  signs  of  differences 
between  sample  means  [P(8,N)]t  378-380 
figure  showing,  379 
table  for  constructing,  378 
definition,  15,  16,  28,  41,  42,  232,  278,  358,  427 
detailed,  for  single-sample  variables  inspection 
plan   for   fraction   defective,    determining, 
30-33 
determines   what   fraction   of   production   sub- 
mitted will  be  accepted,  47 
as  determining  factor  in  cost  of  quality,  46 
for  difference  of  means  test,  a  known,  61,  62 
a  unknown,  30,  59-61,  62,  63,  71-73,  76,  77, 
80 
for  difference   of   proportions  test,   sequential, 
424,  426,  427 
single  sample  (Lr),  262,  263 
figure  showing,  262 
for  double-dichotomy  test,  sequential,  424,  426, 
427 
single-sample  (Lt),  262,  263 
figure  showing,  262 
for  double-sampling  plans,  reference  to,  238 
effects  of  changes  in  acceptance  criterion  on, 
21,  45 
of  changes  in  sample  size,  21,  45 
of  reduced  inspection,  45 
of  simultaneous  use  of  several  acceptance- 
rejection  criteria,  50-52,  113,  114 
of  tightened  inspection,  45 
for     experiments     for     determining     whether 
fraction  defective  of  one  process  or  product 
exceeds     another,     sequential,     424,     426, 
427 
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Operating-characteristic  curves  (OC  curves),  for 
experiments  for  determining  whether 
fraction    defective  of   one   process   or 
product  exceeds  another,  single-sample 
(Lt),  262,  263 
figure  showing,  262 
whether  level  of  performance  of  one  process 
or    product    exceeds    another,    30,    59-63, 
71-73,  76,  77,  80 
whether   variability   of   performance   of   one 
process  or  product  exceeds  another  (L<p), 
281-283,  296,  316,  317,  318n. 
figures  showing,  296,  297,  318rc. 
tables  for  constructing,  284-295,  298-309 
whether   variability   of   performance   of   one 
process    or    product    exceeds    standard 
(Lp,Lff),  sequential,  233 
single    sample,    231,    232,    271-278,    315, 
318n. 
figures  showing,  234,  274,  275,  315n. 
large-sample  formula  for,  232 
tables  for   constructing,   272,   273,   276, 
277 
whether   variability   of   performance   of   one 
process  or  product  is  less  than  standard, 
279 
f  or  F-tests  of  variance  estimates  (L^),  281,  283, 

296,  313,  314,  316,  317,  318n. 
figures  showing,  296,  297,  318n. 

tables  for  constructing,  284-296,  298-309 
figures  showing,  15,  29,  234,  263,  274,  275,  296, 

297,  379 

references  to,  238n.,  S18n. 
five-point,  28-30,  427 

figure  showing,  29 
for  fraction  defective  tests  (see  Operating-char- 
acteristics   curves,    attributes    inspection; 
variables  inspection) 
for    general    one-parameter    tests,    sequential 
{LB),  426-427 
single  sample,  61,  62 
information  provided  by,  15,  16,  28,  41,  42,  278, 

358,  427 
interpretation  of,  15,  16,  28,  41,  42,  232,  278, 

358,  427 
location  of,  21,  45 
for  mean  test,  single  sample,  a  known,  61,  62 

a  unknown,  30,  59-63,  71-73,  76,  77,  80 
of  multiple-sampling  plans,  references  to,  238 

steps  in  computation  of,  outlined,  361 
for  one-decision  sequential  binomial  test  (RP), 

formula  for,  358 
ordinate  of,  15,  16,  28,  41,  42,  232,  278,  358,  427 
for  proportion  test,  sequential  sampling,  426, 
427 
single  sample,  260n.,  262,  263 
when   range   in   samples    of    k   determines   lot 
quality  (Lp,Lc),  sequential  sampling,  233 
figure  showing,  234 
single  sample,  231,  232,  271-278,  279,  315, 
318n. 
figures  showing.  274,  275,  315n. 
for  sequential-sampling  plans,  233,  358,  361, 
426,  427 


Operating-characteristic  curves  (OC  curves),  for 
sequential-sampling   plans,  general  proce- 
dure for  constructing  one-parameter  tests 
(Le),  426,  427 
five-point,  427 

for  one-decision  tests  (RP),  358 
for  specific  tests  (see  the  test  concerned) 
for  truncated  tests,  outline  of  steps  for  com- 
puting, 361 
when  several  acceptance-rejection  criteria  are 

used  simultaneously,  50-52,  113,  114 
for  single-sample  inspection  plans  and  tests  (see 

the  test  concerned) 
as  specification  of  what  will  be  accepted,  46 
for  standard  deviation  tests  (L^,,Lp,L<t),  sequen- 
tial, 233 
figure  showing,  234 
single  sample,  231,  232,  271-278,  279,  281- 
283,  296,  315-317,  318n. 
figures  showing,  274,  275,  296,  297,  315n. 
large-sample  formula  for,  232 
tables  for  constructing,  272,  273,  276,  277, 
284-296,  298-309 
steepness  of,  21,  45        T 
typical,  figure  showing,  15 
when  upper  and  lower  limits  are  imposed  on  a 

single  quality  characteristic,  52 
use  of,  to  determine  what  fraction  of  production 

submitted  will  be  accepted,  47 
use  in  selecting  plan  of  action,  15 
for  variables  inspection  plans  for  fraction  defec- 
tive, 15,  16,  21,  28-33,  41,  42,  45-50,  52, 
65,  66,  71-73,  76,  77,  80 
when  averages  in  samples  of  n  determine  lot 

quality,  48 
computation  of,  from  large-sample  formula, 
30-33 
from  non-central  t  distribution,  71-73,  76, 
77,  80 
dependence  on  normality  assumption,  49,  50 
effect  of  changes  in  N  or  k  on,  21,  45 
figure  showing,  29 
large-sample  formula  for,  30 
plotted  as  function  of  K  =  (U  —  y)/a,  49 
Operating-characteristic  surface,  16,  52,  261,  262 
Order  of  sample  means,  number  of  observations 
needed  when  decisions  are  based  on,  380,  381 
optimum  decision  function,  381,  382 
probability  opposite  to  population  means,  377- 
380 
figure  showing,  379 
formula  for,  379 
table  giving,  378 
Ordered  magnitudes,  367,  368n. 
Ore,  Oystein,  195n.,  196n,,  222 
Origin,  shifting,  to  simplify  graphical  procedure 
for  variables  inspection,  36,  37,  41,  54,  69-71 


P:  parameter  of  binomial  distribution,  250,  251 
P:  proportion   of  items   with  given  attribute  in 
infinite  population,  255,  256,  397,  410 
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P:  proportion  of  population  or  lot  to  be  included 

between  tolerance  limits,  47n.,  98,  99 
P:  true  long-run  probability  of  "success"  in  any- 
single  trial,  250,  251 
(See  also  p) 
Pe:  value  of  P  for  experimental  or  new  method, 

process,  or  product,  259 
Ps:  value  of  P  for  standard  or  old  method,  process, 

or  product,  259 
Po:  probability  or  probability  density  correspond- 
ing to  observations  in  hand,  if  null  hypothe- 
sis is  true,  420 
as  denominator  of  likelihood  ratio,  420 
use    of    logarithm    of,    to    simplify    sequential 
analysis,  421 
Pi:  probability  or  probability  density  correspond- 
ing to   observations  in   hand,   if   alternative 
hypothesis  is  true,  420 
as  numerator  of  likelihood  ratio,  420 
use    of    logarithm    of,    to    simplify    sequential 
analysis,  421 
P(8,N):   probability  sample   means   based   on   N 
observations    will    be    in    opposite   order   to 
population   means    when   latter   differ   by    5 
standard  deviation  units,  378,  379 
figure  showing,  378 
table  of,  379 
PA:  process  average  percent  defective,  42 
p:  fraction  defective  in  lot,  15 
p:  observed  proportion,  255,  397,  410 
p:  parameter  of  binomial  distribution,  421 
p:  true  long-run  proportion  of  "successes"  with  a 

given  method,  421 
pi -a-  the  fraction  defective  for  which  the  prob- 
ability of  acceptance  is  1  —  a.  with  a  given 
sampling  plan,  82 
pp:  the  fraction  defective  for  which  the  probability 
of  acceptance  is  (5  with  a  given  sampling  plan, 
82 
Pe:  observed  proportion  of  successes  with  experi- 
mental method,  256 
ps:  observed  proportion  of  successes  with  stand- 
ard method,  256 
po:  indifference  quality,  20,  29 
po:  value  of  p  when  null  hypothesis  is  true,  421 
pi:  acceptable-quality  level,  16,  229,  357,  359 

(See  also  Acceptable-quality  level) 
pi:  value  of  p  for  first  of  two  methods,  processes  or 

products,  424 
pi:   value   of   p   when   hypothesis   alternative   to 

p  =  po  is  true,  421 
pi:  lot  tolerance  percent  defective,   16,  229,  357, 
359 
(See  also  Lot  tolerance  percent  defective) 
pt:  value  of  p  for  second  of  two  methods,  424 
p':   fraction   defective   for    which   probability    of 

acceptance  (LPr)  is  1  —  LP,  32 
Pairman,  Eleanor,  194n.,  222 

Parameter,   of  binomial  distribution   (P,p),   250, 
251,  421 
of  Poisson  distribution,  411,  426 
of  population,  distinguished  from  statistic  and 
sample  estimate,  12,  161 
Park,  R.  H.,  335n. 


Paulson,  Edward,  ix,  xi,  99,   109,  248,  393,  394, 

419n. 
Peach,  Paul,  48n.,  93 
Pearse,  G.  E.,  194n.,  223 

Pearson,  Egon  Sharpe,  166n.,  184,  229,  230,  234, 
271n.,  318,  335n.,  386n.,  389n.,  393,  394 
quotation  from  article  by,  on  "omnibus"  tests, 

386 
quotation  from  letter  of,  on  accuracy  of  con- 
fidence belt  charts,  335 
Pearson,  Karl,  194n.,  222,  223 
Pegram,  George  B.,  x 
Peiser,  A.  M.,  149n.,  184 
Percent  defective  (see  Fraction  defective) 
Percentage  points  of  statistical  test  functions  (see 

Significance  levels) 
Percentages  (see  Proportions) 
<p:  transformed  proportion,  398,  399 

(See  also  Inverse-sine  transformation) 
<Pb:      transformed      proportion      with     Bartlett's 
adjustment,  401 
(See  also  Bartlett's  adjustment;  Inverse-sine 
transformation) 
<j>:  the  ratio  of  two  variances  (<ri2/<T22),  281 
4>(a,P,n\,ri2),  formula  for,  282 

derivation  of,  281n.,  316,  317 
special  cases  of,  282 
tables  of,  284-295,  298-309 
w.  ratio  of  circumference  of  a  circle  to  its  diameter, 
189 
value  correct  to  20  decimal  places,  189n. 
value  expressed  as  common  fraction,  correct  to 
6  decimals,  189n. 
Pitman,  E.  J.  G.,  386n.,  394 

Plan  of  action,  of  an  acceptance  inspection  plan, 
definition  of,  13 
for  locating  maximum  of  a  function,  sequential 

procedure  for,  367-369 
suitability  of,  shown  by  OC  curve,  15 
for  variables  inspection  for  fraction  defective, 
outline  of,  14 
procedures  for,  16-21,  33-41 
Planning    experiments,    245-372    (see    Design    of 

experiments) 
Plans  for  experiments  (see  Experiment  designs) 
Poisson  distribution  or  series,  relation  of  partial 
sum  to  percentage  points  of  x2,  109 
mean,  variance,  third  and  fourth  moments  of, 

411 
sequential  procedure  for  testing  parameter  of, 

426 
tables  of,  reference  to,  109 

transformation  of,  to  stabilize  variance,  deriva- 
tion of,  411,  412 
(See  also  Square-root  transformation) 
Population,  or  universe,  12,  98,  227,  251,  341,  355, 
377 
(See  also  Lot) 
Population  mean  (m,£),  12,  161 

(See  also  Confidence  limits,  for  n;  Estimation, 
of  population  mean;  Mean,  of  population; 
Statistical  tests,  relating  to  population 
mean) 
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Population  proportion  (P,p),  255-256,  397,  410 
(See  also  Binomial  distribution,  parameter  of; 
Confidence    belts;     Confidence    intervals; 
Confidence  limits;  Estimation;  Proportion, 
population;  Statistical  tests) 
Population  standard  deviation  (a),  12,  161 

(See    also     Confidence    limits;     Estimation; 
Standard  deviation,  of  population;  Statis- 
tical tests) 
Population  variance  (a2),  12,  161 

(See     also    Confidence    limits;    Estimation; 
Statistical  tests;  Variance,  population) 
Power  function,  of  a  statistical  test  procedure,  427 
(See     also     Operating-characteristic     curves, 
Operating-characteristic  surface) 
Precision  (see  Standard  deviation;  Variance) 
Preliminary  sample,  use  of,  in  attributes  inspec- 
tion when  severity  of  any  one  test  cannot  be 

controlled  precisely,  336-338 
in  planning  regression  studies,  when  cost  of 
each     measurement     depends     upon     its 
magnitude,  321-328 
in  planning  sensitivity  tests,  345,  346,  349- 
352 
Preliminary  trial  of  an  examination,  evaluating, 

280 
Princeton  University,  ix 
Probabilities,  compounding,  167 
Probability,  absolute,  251 

of  acceptance,  15,  16,  28-33,  41,  42,  45,  50-52, 
232,  233,  261-263,  271,  278,  279,  282,  315- 
317,  358,  361,  427,  428 
(See  also  Operating  characteristics;  Operat- 
ing-characteristic       curves;        Operating- 
characteristic  surface) 
effect  upon,   of  changes  in   acceptance   cri- 
terion, 21,  45 
of  changes  in  sample  size,  21,  45 
of  reduced  inspection,  45 
of  simultaneous  use  of  several  acceptance 

criteria,  50-52,  113,  114 
of  tightened  inspection,  45 
evaluation  of  (see  LP)  L</>;  Lp;  L(t;Lt;P(8,N); 
Computation,  of  OC  curve) 
conditional,  251 

density  or  density  function,  83,  155,  161,  193- 
195,  217,  270,  281,  342,  392 
definition  of,  193n. 
distributions,   10,  61,  62,  72,  73,  99,   109,   114, 
154,  155,  193-195,  197-199,  202,  214,  215, 
229,  230,  251,  270,  281,  342,  392 
element  of,  155,  161 

(See  also  Probability,  density) 
function  of  (see  Probability,  distributions) 
that  gun  barrel  will  be  proofed,  332 
paper,  arithmetic  normal,  414 

binomial,  414n. 
that  proximity  fuze  will  operate,  342 
that  sample  means  will  be  in  opposite  order  to 
population  means,  378-380 
Probability-wise  interpolation,  316??.,  318 
Probits,  analysis  of  data  expressed  as,  342-344 
definition  of,  343 
estimation  of  n  and  9  from,  343,  350 


Probits,    experiments  leading   to  data   expressed 
as,  341,  342 
optimum  design  of,  345-346,  349-352 
relation  to  normal  deviates,  343 
tables  of,  references  to,  343r?.,  352 
variance  of,  formula  for,  350 
Process,    average    (see    Process    average    percent 
defective) 
new  (see  New  method) 
old  (see  Old  method) 
quality  level  of   (see  Process  average    percent 

defective) 
standard  (see  Standard  method) 
Process  average  percent  defective  (PA),  42-45 
computation  of  confidence  interval  for,    from 
attributes  data,  333-336 
from  variables  data,  42-45 
as  criterion  for  determining  whether  to  main- 
tain or  change  volume  of  inspection,  42 
example  of,  44 
estimation  of,  from  attributes  data,  333-336 
from  variables  data,  42-45 
Process     control,     contrasted     with     acceptance 
inspection,  156,  157 
importance  of  x  and  s  in,  34 
(See  also  Quality  control) 
Producer's  risk  (a),  conventional  value  of  (5%), 
45n. 
definition  of,  16,  228,  229 
maximum  of,   with  variables  inspection  plans 

for  fraction  defective,  16,  17 
selection  of,  factors  to  consider  in,  45-49 
Product,  new  (see  New  method) 

old  (see  Old  method) 
Product-moment  coefficient  of  correlation,  413 
Production  possibilities,  analysis  of,  46,  47 
Proof  testing,  statistical  aspects  of,  331-338 
Proofing.,gun  barrels,  331-338 

Proportions  or  percentages,  15,  47n.,  98,  99,  250, 
251,  255,  256,  259,  397,  410,  421,  424 
ASN    curve   for   sequential    one-sided    test    of, 
figures  showing,  240,  242 
formula  for,  427,  428 
confidence  limits  for  (see  Proportions,  popula- 
tion) 
estimation  of  (see  Proportions,  population) 
expected  value  of,  397,  410 

experiments    for    comparing    two,    analysis    of 
data  from,  249-258,  424,  425 
approximate  single-sample  method,  based 
on  binomial  test,  249,  250,  255 
based  on  Xc  253-255,  246-257 
based    on    inverse-sine    transformation, 

257,  258,  263,  264 
based  on  normal  deviates,  253-257,  263 
"exact"  single-sample  method,  250-253 
by    Fulcher-Zubin     Item    Analyzer,     257, 

257n.,  258 
incorrect  method  for,  256 
nomograph  for,  Zubin's,  references  to,  398, 

416 
by  sequential  analysis,  424,  425 
design  of,  258-265,  424,  426-428 
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Proportions  or  percentages,  experiments  for  com- 
paring two,  inverse-sine  transformation  of 
data  from,.  257,  258,  263,  264,  397-408 
number  of  observations  needed,  with  sequen- 
tial sampling,  424,  427,  428 
with  single  samples,  258-261,  264 
formula  for,  258,  259,  264 
nomogram  for,  260 
operating  characteristics  of,  261 
operating-characteristic  curve  (OC  curve)  of, 
determination,  by  formula,  262 

from  nomogram,  263 
figure  showing,  262 
operating-characteristic  surface  (OC  surface) 

of,  261,  262 
planning  of,  258-265,  424 
sequential,  424,  425,  426-428 
experiments  for  estimating  population  or  true 

(see  Proportions,  population) 
experiments   for   testing   a   single,    analysis   of 
data  from,  260n.,  263,  264,  421,  422 
ASN  curves  for  double,  multiple,  and  sequen- 
tial sampling  of,  240,  242 
design  of,  259,  260n.,  261-264,  424,  426-428 
inverse-sine    transformation    of    data    from, 

397-408 
number  of  observations  needed,  with  double 
sampling,  237-244 
figures  showing,  240-243 
with  multiple  sampling,  237-244 

figures  showing,  240-243 
with    sequential    sampling,    26,    237-244, 
419,  420,  427,  428 
figures  showing,  240-243 
formula  for,  427,  428 
with  single  sampling,   14,   15,  26-28,  259, 
260n.,  263,  264 
examples,  26 

formula  for,  259,  260n.,  263,  264 
nomogram  for,  use  of,  260n. 
OC    curve    of,    approximate    determination 
from  formula,  260n.,  262 
from  nomogram,  260n.,  263 
exact    determination   of   points   on,   using 
confidence-belt  charts,  336n. 
using   percentage   points   of   incomplete 
beta  function,  334,  336n. 
planning  of,  259,  260n.,  261-264,  426-428 
sequential,  421,  422,  427,  428 
one-decision  test,  358 
moments  of,  410 

population,  15,  47n.,  98,  99,  250,  251,  421 
bias  of  estimation  from  <p(p)  and  <pb(p),  403, 
404 
tables  showing,  400-402 
estimation  of,  confidence  belts  for,  332-335 
confidence   intervals   for,   based    on   attri- 
butes inspection,  332-336 
based    on    variables    inspection,    42-45, 
336n. 
confidence  limits  for,  evaluation  from  per- 
centage   points    of    incomplete    beta 
function,  333,  334,  337 
formula  for,  335,  336 


Proportions     or    percentages,     population,     esti- 
mation  of,   experiments  for,  analysis  of 
data  from,  332-336 
experiments  for,  design  of,  238,  333-337 
inverse-sine      transformation      of     data 

from,  336 
number  of  observations  needed,  333-337 
sequential,  238n. 
included      between     statistical     tolerance 
limits,  47n.,  98-99 
standard  deviation  of,  255,  397,  410 
tests  of  significance  regarding  (see  Proportions, 

experiments  for .  .  .  . ) 
transformation   of,   to  stabilize   variance,   257, 
258,  263,  264,  397-408 
(see  Inverse-sine  transformation) 
variance  of,  397,  410 
Protection  afforded  by  sampling  inspection,   46, 

47,  156-158 
Przyborowski,  J.,  46n.,  93,  255n.,  265 
Psychological  considerations,  278 


Quality,  acceptable,  61,  228,  229,  269,  278,  279, 
423 
(See  also  Acceptable-quality  level) 
of  accepted  lot  or  product,  16,  41,  42,  46,  156- 

158 
attainable,  46,  47 
bad  (see  Quality,  unacceptable) 
choice  of  standards  of,  45-49 
control  of  (see  Quality  control) 

(See  also  Control  charts;  Control  limits) 
cost  of,  OC  curve  a  determining  factor  in,  46-47 
desired,  45,  46 
evaluation    of    (see    Quality,    of    lot;    Quality, 

produced  by  process) 
feasible,  45-47 

good  (see  Quality,  acceptable) 
high,  42,  43 
improvement  of,  46 
information  about,  42 

provided  by  process  average  percent  defec- 
tive, 43 
by  sampling  inspection,   156-158 
by  statistical  tolerance  limits,  47,  97-100 
of  an  item,  7 

of  lot,  determined  by  average  quality  of  items 
of  lot,  8,  27-28,  61-63 
by  dispersion  of  quality  of  items  of  lot,  8, 

27,  28,  229,  269,  270,  278,  279,  280,  283 
by  fraction  defective,  7,  8 

combinations  of  /z  and  a  that  correspond 
to  same  fraction  defective,  11 
by  fraction   of   averages   of  samples   of  n 

that  exceed  specified  limit,  48 
by  fraction  of  ranges  Rk  in  samples  of  k 
that  exceed  specified  limit,  228,  229 

formulation  in  terms  of  <r,  229-231 
by  mean  quality  (n)  in  conjunction  with 

dispersion  of  quality  (a),  11 
by  mean  quality  (/i)  of  items  of  lot,  8,  27, 

28,  61-63 
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Quality,    of   lot,   determined   by    several  quality 

characteristics,  50-52,  113,  114 

by  standard  deviation   (<r)   of  qualities  of 

items   of   lot,    8,    27n.,    28n.,    229,    269, 

270,  278,  279,  280,  283 

by    upper    and    lower    limits    on    a    single 

quality  characteristic,  52-56,  63,  64 
by  variability  of  quality  among  items  of 
lot,  8,  27,  28,  229,  269,  270,  278-280,  283 
evaluation     of     (see     Confidence     intervals; 
Confidence    limits;    Estimation;    Toler- 
ance limits) 
judgment  of,  from  a  sample  or  samples  (see 
Acceptance     criterion;     Acceptance    in- 
spection) 
submitted,  16,  41,  42,  46,  156-158 
low,  42,  43 
maintenance  of,  43n. 
measures  of  (see  Quality,  of  lot) 
of  outgoing  product,  16,  41,  42,  46,  156-158 
poor,  42,  43 

produced  by  process,   evaluation  of   (see  Con- 
fidence intervals;  Confidence  limits;   Esti- 
mation; Process  average  percent  defective; 
Tolerance  limits) 
information     about,     provided     by     process 
average  percent  defective,  43 
by  sampling  inspection,  156-158 
by  statistical  tolerance  limits,  47,  97-100 
of  a  sample,  determined  by  sample  average,  48 

by  sample  range,  228,  229 
submitted,  16,  41,  42,  46,  156-158 
unacceptable,  45n.,  47-49,  229 
Quality  control,  applied  to  air  testing  of  bomb- 
sights,  113-184 
contrasted    with    acceptance    inspection,     117, 

156-158 
feature,    in    procedure    for    going    on    reduced 

inspection,  43n. 
functions  of,  117 
importance  of  x  and  s  in,  34 
multivariate,  113-184 

use  of  a  single  "score"  or  "index"  in,  113, 
114 
based  on  generalized  Student  T,  114-116 
advantages  of,  115 
objectives  of,  156,  157 

(See  also  Control  charts;  Control  limits) 


Rk:  acceptance  number  for  the  &*    sample,  237 

CSee  also  Rejection  number) 
Rk:  range  of  a  sample  of  k  items,  229 

(See  also  Range) 
r:  observed  product-moment  coefficient  of  correla- 
tion, 413 
2-transformation     of,     to     stabilize     variance, 
derivation,  413 
Radio  proximity  fuze,  341 

analysis  of  test  data,  method  of,  342-344 
design  of  tests  of,  optimum,  345-352 
Random  selection,  of  bombsights,  reasons  for,  158 


Random  variation,  components  of,  269,  297,  310 
311 
estimation  of,  and  tests  of  hypotheses  regard- 
ing, 311-314 
Randomization,  methods  of,  176,  177 
as  precautionary  measure,  118 

to  prevent  bias  from  time  trends,  371 
restricted,  177 

role  of,  in  design  of  experiments,  141,  142,  176, 
177 
Range,  calculation  of,  227 
definition  of,  227 

distribution    of,    in    samples    from    a    normal 
population,  234 
cumulative,  229 

figure  showing,  230 
tables  of,  reference  to,  229,  234 
mean  of,  table,  reference  to,  227n. 
standard  deviation  of,  227?i. 
as  estimator  of  a,  227 

efficiency  of,  in  small  samples  from  a  normal 
population,  227n. 
as  measure  of  dispersion  of  sample,  227 

reasons  for  use,  227 
in  samples  of  k,  primary  characterization  of  lot 
quality,  227,  228 
acceptance  inspection  when,  227-234 
Range    of    normal    variation,    97 

(See  also  Tolerance  limits) 
Rational  sentencing  of  lots,  156,  157 
Rectangular  distribution,  195,  217,  218 

as    representation    of    grouping    or    rounding 
errors,  195 
for  tabular  entries,  215-218 
semi-invariants  of,  195,  196n. 
standard  deviation  of,  218 
use  of,  in  checking  sums  and  tables,  example, 

218 
variance  of,  195,  218 
Reduced  inspection,  41-45 

criterion  for  changing  to,   when  lot  quality  is 
measured  by  fraction  defective,  42 
procedure   for   judging   whether   criterion   is 
met,  using  variables  data,  42-45 
when  upper  and  lower  limits  are  imposed 
on  a  single  quality  characteristic,  56 
effect  of,  on  OC  curve,  45 
methods  of  achieving,  with  variables  inspection 

for  fraction  defective,  45 
purpose  of,  41,  42 
Redundant  requirements  in  specifications,  50n. 
Rees,  Mina,  x 

Regression  analysis,  when  cost  of  each  measure- 
ment depends  upon  its  magnitude,  319-328 
when  dependent  variable  is  expressed  in  probits, 
342-344,  349-352 
Rejection  criterion,  for  sequential  likelihood  ratio 
tests,  420,  421 
for  sequential  variables  inspection  for  fraction 

defective,  83,  85 
for  single-sample  tests  (see  Acceptance  criterion) 
(See  also  Rejection  line;  Rejection  number) 
Rejection  line,  232,  357-360,  422,  423,  425 

adjusting,  to  allow  for  acceptance  line,  358-360 
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Rejection  line,  figure  showing,  357 

for  number  of  "successes"  in  binomial  test,  422 
for  number  of  "success-failure"  pairs,  in  double- 
dichotomy  test,  425 
for  one-decision  binomial  test,  358 
for  sum  of  observations,  in  test  for  n,  a  known, 

423 
for  sum  of  squared  observations,  in  test  for  a,  n 
unknown,  232 
Rejection  number   (Rk,   Si),   237,   239,   244,   358, 
422,  425 
definition  of  (Rk),  237 
in  double  sampling,  237 

examples,  239,  244 
in  multiple  sampling,  237 

examples,  239,  244 
relation   to    acceptance   number    (Ak)    charac- 
terizes type  of  sampling  plan,  237 
in  sequential  sampling  (Rk),  237 
in  sequential  tests  (Si),  422,  425 
in  sequential  tests  that  provide  for   rejection 

only,  358 
in  single  sampling,  237,  260n.,  263,  264 
Rejection  region,    boundaries    of,    for    sequential 
tests  (see  Rejection  line) 
for  single-sample  tests  (see  Acceptance  region, 
boundaries  of) 
Relationship,  determining,  when  cost  of  each  meas- 
urement depends  upon  its  magnitude,  319-328 
Replication,  dual  role  of,  in  experimentation,  159 
p:   product-moment   coefficient   of   correlation  in 

normal  bivariate  population,  413 
p:  ratio  of  population  variances  (<ri2/ao2),  271 
p(a,j8,ni),  definition  of,  271 

large-sample  formula  for,  278n. 
tables  of,  272,  273,  276,  277 
use  of,  examples  of,  271,  274,  275,  278 
p'  (a,/3,ni),  definition  of,  279 
relation  to  p(a,/S,m),  279 
use  of,  example  of,  280 
Rider,  Paul  R.,  386rc.,  394 
Rietz,  Henry  Lewis,  196n.,  197rc.,  218n.,  223 
Risks,  in  use  of  sampling  inspection,  156,  157 

(See   also   Consumer's   risk;   Operating-char- 
acteristic curves;  Producer's  risk) 
Robbins,  Herbert,  109 
Rodermund,  Edythe  B.,  x 
Romig,  Harry  G.,  27n.,  93 

Rounded  numbers,  impossibility  of  avoiding  use 
of,  188,  189 
measurements  as,  187—189 
scale  readings  as,  188,  189 
tabular  entries  as,  191,  215-218,  221 
Rounding,  187-223 

accepted  procedure  for,  189,  190 

(See  also  Rounding  Rule) 
Cayley's  procedure  for,  191 
effect  of,  on  descriptive  statistics,  187,  188 
on  distribution   of  error   of  an  interpolated 
value,  221 
of  F,  214,  215 
of  s2,  198-200,  203-214 
of  Student's  t,  200-203,  214,  215 
of  x,  196,  198,  212,  213 


Rounding,  effect  of,  on   moments  of  probability 
distributions,  193-196 
representation      as      effect      of      random 
"errors,"  195 
on  standard  procedures  for  statistical  infer- 
ence, 191-215 
impossibility  of  avoiding,  188,  189 
reasons  for,  187-189 
repeated,  consequences  of,  190,  191 

(See  also  Rounding  interval;  Rotmding  lat- 
tice;   Rounding    rule;    Sheppard's    correc- 
tions) 
Rounding  interval,  recommended  size  of,  192,  214 

reasons  for,  214 
Rounding  lattice,  effect  of  position  of,    on    dis- 
tribution of  s2,  206-213 
on  distribution  of  x,  212,  213 
random  location  of,  194 

extends  validity  of  Sheppard's    corrections, 

196 
procedure  for,  when  only  non-negative  values 
occur  in  population,  196 
Rounding  rule,  189,  190 
Cayley's  form  of,  191 

effect  of  repeated  applications  of,  190,  191 
elementary  properties  of,  190,  191 
importance  of  adhering  to,  190 
Roy,  S.  N.,  184 
Royalties,  vii 


S 


S:  sensitivity  of  a  fuze,  342 

s:  sample  estimate  of  population  standard  devia- 
tion (trt,  based  on  sum  of  squared  devia- 
tions from  sample  mean,  12, 13, 162, 192 
adjustment  of,  58 

advantages  of,  relative  to  mean  deviation,  162, 
163 
relative  to  sample  range,  227n. 
association  of  values  of,  with  values  of  x,   when 

data  are  grouped  or  rounded,  187,  188 
bias  of,  58 

adjustment  factor,  formula  for,  58 

tables  of,  references  to,  58n. 
when   computed   from   grouped    or   rounded 
data  (see  s2) 
combining  values  of,  to  estimate  variability  of 

process,  42-43 
computation  of,  12,  13,  33,  34,  42,  43,  162,  192 
N  -  1  versus  N  in,  13,  58 
naive  procedure  for,  criticism  of,  33 
using  modern  computing  machines,  33,  34 
definition  of,  12,  13,  162,  192 
N  -  1  versus  N  in,  13,  58 
degrees  of  freedom  of,  12,  13,  43 
(See  also  s2,  degrees  of  freedom  of) 
division  by,  in  computing,  12,  13,  43 
as  denominator  of  Student's  t,  72,  200 

Sheppard's  corrections  not  to  be  used  for,  203 
design  of  experiments  for  testing,  269-296 
distribution  of,  approximately  normal  for  large 
N,  58 
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s:  sample  estimate  of  population  standard  devia- 
tion  (<r),  distribution  of,   mean   of,   for- 
mula for,  58 
tables  of,  references  to,  58n. 
standard  deviation  of,  formula  for,  58 

tables  of,  references  to,  58n. 
(See  also  s2,  distribution  of) 
effect  of  grouping  or  rounding  of  data  on  (see  s2) 
efficiency  of,  162,  227 

expected  value  of,  in  normal  samples,  formula 
for,  58 
tables  of,  references  to,  58n. 
formula  for,  12,  13,  43,  162,  192 
grouping  of  data,  effect  on,  187,  188  (See  also  s2) 
homogeneity  of  two  or  more  values  of,  385-389 
importance  of  distinguishing  from  <x,  12 
importance  of,  in  quality  control,  34,  42-44 
largest  of  several  values  of,  testing  of,  383-394 
logarithmic     transformation     of,     to     stabilize 

variance,  412,  413 
mean  of,  in  normal  samples,  formula  for,  58 

tables  of,  references  to,  58n. 
multiple  of,  for  setting  tolerance  limits  (see  K) 
used    in   variables  inspection  for   fraction 
defective  (see  k) 
iV  —  1   versus   N,  in  definition  and   computa- 
tion of,  13,  58 
standard    deviation    of,     in    normal    samples, 
formula  for,  58 
tables  of,  references  to,  58n. 
termed      "sample     standard     deviation"     for 

brevity,  12,  13 
tests  of  significance  regarding  (see  s2) 
transformation   of,   to   stabilize   variance,   412, 

413 
use  of,  in  acceptance  inspection  for  a,  231,  232, 
270,  271,  278-280 
in    acceptance    inspection    when    ranges    in 
samples  of  k  determine  lot  quality,  227- 
234 
in  computing  control  limits  for  means,  147, 

148 
in  estimating  process  average  percent  defec- 
tive, 41-44 
opposition  to,  rebuttal  of,  33,  34 
in  quality  control,  34,  42-44,  151,  152 
in  setting  tolerance  limits,  47 
in  variables  inspection  for  fraction  defective, 
9-14,  33,  34,  41-44,  48,  53,  56,  58-64,  71- 
73,  83-85 
s:  sample  estimate  of  population  standard  devia- 
tion (<r),  based  on  cumulative  distribution 
(probit  data),  343,  350 
variance  of,  351 

with  optimum  experimental  design,  352 
«:  slope  of  parallel  lines   in   graphical  sequential 
analysis,  232,  233,  357,  358,  426,  427 
formula  for,  binomial  test,  one-decision,  358 
binomial  test,  one-sided,  422 
double-dichotomy  test,  425 
mean  test,  a  known,  423 
standard  deviation  test,  n  unknown,  233 
use  in  constructing  ASN  curve,  233 
five-point,  427 


s:  slope  of  parallel  lines,  in  graphical  sequential 
analysis,  use  in  constructing  OC  curve, 
233 
five-point,  428 
se:  observed  number  of  "successes"  in  N  trials 

with  experimental  method,  262 
ss:  observed  number  of  "successes"  in  N  trials 

with  standard  method,  262 
s2:  sample  estimate  of  population  variance  (o-2), 
12-13,  43,  162,  198 
bias    of,    when    computed    from    grouped    or 

rounded  data,  198-200,  203-214 
computation  of,  12,  13,  33,  34,  42,  43,  162,  198 
control  limits  for,  151 
correlation   of,    with   x,    when   computed   from 

grouped  or  rounded  data,  187,  201 
definition  of,  12,  13,  43,  162,  198 
degrees  of  freedom  of,  12,  13,  43,  199,  203,  270, 

312,  387,  388 
design    of    experiments    for    determining    and 

testing,  269-296 
distribution  of,   mean  of,  in  sample  from  arbi- 
trary grouped  or  rounded  populations,  199 
in  large  samples  from  arbitrary  population, 
199 
effect  of  grouping  or  rounding  on,  198-200 
in  samples  from  a  normal  population,   199, 
270 
effect  of  grouping  or  rounding  on,  in  large 
samples,  198-200 
in  small  samples,  203-214 
variance     of,     in     samples    from    arbitrary, 
grouped,  and  normal  populations,  199 
effect  of  grouping  or  rounding  of  data  on,  187, 

188,  198-200,  203-214 
efficiency  of,  162 
expected  value  of,  in  samples  from  arbitrary, 

grouped,  and  normal  populations,  199 
formula  for,  12,  13,  43,  162,  198 
grouping  of  data,  effect  on,  187,  188,  198-200, 

203-214 
homogeneity  of  two  or  more  values  of,  testing 

of,  385-389 
importance  of  distinguishing  from  a2,  12 
independent    of    x    in    samples    from    normal 
population,  200 
effect  of  grouping  or  rounding  on,  187,  188, 
201 
largest  of  several  values  of,  testing,  383-394 
logarithmic     transformation     of,     to     stabilize 

variance,  412,  413 
mean  of,  in  samples  from  arbitrary,  grouped, 

and  normal  populations,  199 
moments  of,  in  samples  from  a  normal  popula- 
tion, 412 
tests  of  significance  regarding,  sequential,  232, 
233,  426 
single  sample,  231,  232,  269-296,  383-394 
transformation  of,  to  stabilize  variance,  412,  413 
uses  of,  special  (see  s) 

variance  of,  in  samples  from  arbitrary  popula- 
tion, 199 

when  data  are  grouped  or  rounded,  199 
in  samples  from  normal  population,  199 
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s2:  sample  estimate  of  population  variance  (a2), 
variance  of,  in  samples  from  normal  popula- 
tion, when  data  are  grouped  or  rounded,  211 
yardsticks  for,  152 
Salzer,  H.  E.,  109,  416 
Sample  average  (see  Sample  mean) 
as  basis  for  decisions,  156-158 

(See  also  Consumer's  risk;   Operating   char- 
acteristics; Operating-characteristic  curves; 
Producer's  risk) 
mean  (see  Sample  mean) 
median  (see  Median) 
random,  158-160 
representative,  160 
selection  of,  158-160 
size  of  (see  Sample  size) 
Sample  mean  (x),  12,  43,  100,  192 

advantages  of,   relative  to  sample  median,  in 

normal  samples,  161,  162 
bias  of,  when  data  are  grouped  or  rounded,  187, 

188,  197,  212 
combining,  from  several  samples,  in  estimating 

process  average  percent  defective,  42,  43 
computation  of,  12,  43,  100,  192 
control  limits  for,  147-150 
correlation    of,    with    s    or    s2,    when   data   are 

grouped  or  rounded,  187,  188,  201 
definition  of,  12,  43,  100,  192 
as    descriptive   statistic,   effect  of  grouping  or 

rounding  on,  187-188 
design  of  experiments  for  testing,  17,  27n.,  28n., 

60-66,  81,  82,  377-382,  423 
distribution  of,  for  arbitrary  population,   197, 
198 
effect  of  grouping  or  rounding  on,  196-198, 
203 
for  normal  population,  162 
effect  of  grouping  or  rounding  on,  187,  188,  196, 

198,  212,  213 
efficiency  of,  in  normal  samples,  162 

when  data  are  grouped  or  rounded,  197 
as  estimator  of  population  mean   (n),  arbitrary 
population,  197,  198 
when  data  are  grouped  or   rounded,    196- 
198,  212,  213 
normal  population,  162 
(See  also  Confidence  limits,  for  fi) 
expected  value  of,  197,  198,  212 
importance  of  distinguishing  from  population 

mean  (n),  12 
mean  of,  197,  198,  212 
moments  of,  in  samples  from  arbitrary,  grouped, 

and  normal  populations,  197,  198 
use  in  quality  control,  34,  42-44,  147-150 
in  variables  inspection  for  fraction  defective, 
9-14,    33,    34,    41-44,    48,    53,    56,    58-64, 
71-73,  83-85 
variance  of,  162,  197,  198 
yardsticks  for,  150 
Sample  mean  deviation  (see  Mean  deviation) 
Sample  median  (see  Median) 

Sample   size,    adjustment   of,   by   modifying   OC 
curve,  19 


Sample  size,  average,  233,  238,  420,  427 
(See  also  ASN  curve) 
changes  of,  effect  on  OC  curve,  21,  45 
for  each  of  two  samples  (see  N,  second  listing) 
for  single  sample  (see  N,  first  listing) 

(See  also  Amount   of  inspection;   Computa- 
tion   of   number   of    observations   needed; 
Formula,     for     number     of     observations 
needed;  Na,  Nm,  N*,  Nt) 
Sample  standard  deviation,  12,  13,  187n.,  1887?. 

(See  also  s) 
Samples,  combined,  42,  43 
Sampling,  biased,  160 
curtailed,  14,  238,  361 

random,  role  of,  in  experimentation,  158-160 
representative,  likelihood  of  bias  of,  160 
Sampling  inspection,  aims  of,  156-158 
control  features  of,  42,  43,  157 
effectiveness  of,  156-158 

(See  also  Operating  characteristics;  Operat- 
ing-characteristic curves;   Operating-char- 
acteristic surface) 
objectives  of,  156-158 
risks  involved  in,  228,  229 

(See  also  Consumer's  risk;  Producer's  risk) 
steps  in,  13,  14 

value  of,  dependent  upon  quality  of  submitted 
product,  42 
(See  also  Acceptance  inspection;  Inspection) 
Sampling-inspection  plans,  attributes   (see  Attri- 
butes, inspection) 
changes  in,  41-45,  142-145 
classification  of,  237 

comparison  of,  26-28,  85,  237-244,  419,  420 
double,  237-244 
multiple,  237-244 
multivariate,  111-184 

sequential,  83-85,  232,  233,  358-362,  419-428 
single   sample,   7-82,    111-184,    231,    232,    237, 
249-264,  269-296,  331-338,  377-381,  385- 
393 
(See  also  Acceptance  inspection;  Inspection; 
Statistical  tests) 
Savage,  Leonard  J.,  ix,  xii,  238n.,  244,  354,  364 
Scale  readings,  188,  189 
Scheffe,  Henry,  109 
Schumpeter,  Joseph  A.,  145n.,  183 
Scores,  examination,  dispersion  of,  280 

use   of,   in   acceptance  inspection   and   quality 
control,    when   several   quality   character- 
istics are  involved,  50-52,   113-116 
Sealy,  E.  H.,  156,  157,  184 
quotation  from,  156,  157 
Semi-invariants,  of  rectangular  distribution,  195, 
196rc. 
Sheppard's  corrections  for,  196n.,  222 
Sensitivity  tests,  341,  342 

analysis  of  data  from,  342-345 
optimum  design  of,  345,  346,  349-352 
use  of  preliminary  data  in,  346-349 
Sequential  analysis,  26,  83-85,  232-234,  237-244, 
357-362,  367-372,  417-429 
advantages  of,  26,  367-371,  419,  420 
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Sequential  analysis,  compared  with  non-sequential 
analysis,  26,  85,  237-244,  367-372,  419,  420 
conditions  for  superiority  of,  over  non-sequen- 
tial analysis,  369,  371,  419,  420 
for    superiority    of    non-sequential    analysis, 
371,  420 
definition  of,  237,  367,  419 
disadvantages  of,  371 
elements  of,  417-429 
Sequential    design,    of    experiment,    for    locating 
maximum  of  a  function,  367-372 
when  failure  rate  of  equipment  is  high,  353- 
362 
Sequential  estimation,  of  parameter  of  binomial 

distribution,  reference  to,  238n. 
Sequential-sampling-inspection  plans,  83-85,  232, 
233,  358-362,  421-426 
(See  also  Statistical  tests,  sequential) 
acceptance    line,    in    graphical    procedure    for, 
232,  356,  357,  359,  360,  422,  423,  425 
(See  also  Acceptance  line) 
acceptance  number  in,  237,  422,  425 
amount  of  inspection  in  (see  Amount  of  inspec- 
tion, with  sequential  sampling) 
operating-characteristic  curves  of  (see  Operat- 
ing-characteristic curves) 
that  provides  for  rejection  only,  357-362 
rejection  line,  in  graphical  procedure,  232,  356, 
357,  359,  360,  422,  423,  425 
(See  also  Rejection  line) 
rejection  number  in,  237,  422,  425 
Severity  of  test,  cannot  be  controlled  precisely, 
inspection     for     fraction     defective     when, 
331-338 
Sheppard,  W.  F.,  188,  193,  196,  197,  199,  203,  223 
Sheppard's  corrections,  193-196 
bivariate,  references  to,  193n. 
conditions  for  strict  validity  of,  193 
consequences  when  not  fulfilled,  196 
implications  of,  194 
conditions  for  validity  "on  the  average,"  194- 

196 
when   grouping   lattice   is   located   at   random, 

194-196 
for  J-shaped  distributions,  194 

references  to,  194n.,  222 
multivariate,  references  to,  193n.,  223 
not  to  be  used,  in  computing  confidence  limits, 
199,  200,  203 
in  evaluating  tests  of  significance,   199,  200, 
203 
Shewhart,  Walter  A.,  47n.,  49n.,  58n.,  93,  97n., 

110,  221n.,  223 
Short  cuts,  in  analysis  of  2  X  2  table,  253-255 
o-:   standard   deviation   of  lot  of  population    (see 

Standard  deviation,  of  lot;  of  population) 
<r(N):    standard    deviation    of    number    of    items 
inspected,  238 
figures    showing,    for    double-,    multiple-,    and 
sequential-sampling  plans,  241,  243 
o-o :  standard  deviation  corresponding  to  standard 

method,  process,  or  product,  269 
a\ :     standard    deviation    corresponding    to    new 
method,  process,  or  product,  269 


<ri:    standard    deviation    defining    good    quality, 

when  lot  quality  is  measured  by  fraction  of 

sample    ranges   that    exceed    specified   limit, 

229-231 

ci:    standard  deviation  of  the  first  of  two  methods, 

processes,  or  products,  280 
ay.    standard    deviation    defining    bad    quality, 
when  lot  quality  is  measured  by  fraction  of 
sample   ranges   that    exceed   specified   limit, 
229-231 
ai:    standard    deviation    of    the    second    of    two 

methods,  processes,  or  products,  280 
Significance,    statistical   tests    of    (see    Statistical 

tests) 
Significance  levels,  of  Bartlett's  test,  approxima- 
tion to,  388 
tables  of,  reference  to,  388n.,  394 
of  xc,  approximation  to,  255 
tables  of,  reference  to,  255 
of  x2»  Cornish-Fisher  type  expansion  for,  refer- 
ence to,  109 
for  one  degree  of  freedom,  equivalent  to  Ka2, 

258n. 
relation  of,  to  partial  sum  of  Poisson  series, 
109 
to  significance  levels  of  F-distribution,  316 
tables  of,  references  to,  315n. 
of  Cochran's  test,  tables  of,  390,  391 
of  F,  Paulson's  approximation  to,  393,  394 
for  m  =   oo ,  equivalent  to  significance  levels 

of  x2,  316 
relation  of,  to  percentage  points  of  Fisher's  z, 
317n. 
of  incomplete  beta  function,  392,  393 
tables  of,  comments  on,  151n.,  152n.,  317n. 
references  to,  151n.,  184,  317n.,  318 
Hotelling's  T  (see  Significance  levels,  of  T) 
meaning  of,  270,  271,  281 

of  normal  distribution,  relation  of,  to  probits, 
343 
short  table  of,  18 

tables  of,  references  to,  10n.,  18,  343n. 
need  for  care  in  nsing,  lOn. 
of  range,  in  samples  of  k  from  normal  distribu- 
tion, curves  showing,  230 
table  of,  reference  to,  229n.,  234 
of  T,  asymptotic  expansion  for,  155 

relation  of,  to  significance  levels  of  x2,  155 
of  F,  155 
of  t,  asymptotic  expansion  for,  148,  149n.,  184 

tables  of,  references  to,  148n. 
of  z,  relation  to  significance  levels  of  F,  317n. 
Significant  differences  (see  Statistical  tests) 
Significant  figures,  189 
definition  of,  189n. 

recording  excessive  number  of,  191,  192 
too  few,  192 
Simaika,  J.  B.,  316n.,  318 
Simon,  Herbert  A.,  381,  382 

Simultaneity  of  events,  acceptance  inspection  for, 
227-234 
formulation  in  terms  of  <r,  229-231 
measured  by  fraction  of  sample  ranges  that  are 
less  than  specified  limit,  227,  228 
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Single-sampling-inspection  plans,  7-82,   111-184, 
231,  232,  237,  249-264,  269-296,  331-338, 
377-381,  385-393 
(See  also  Statistical  tests,  single-sample) 
acceptance  number  in,  237,  260n.,  263,  264 
amount  of  inspection  (see  Amount  of  inspection, 

with  single  sample) 
curtailment  of,  14,  238 

operating     characteristics     of     (see    Operating 
characteristics ;       Operating-characteristic 
curves;  Operating-characteristic  surface) 
rejection  number  in,  237,  260n.,  263-274 
Skewness,    of   distribution   of   x,    when   data   are 
grouped, 198 
measures  of  (a//?, yi),  197,  198 
Slope,  of  linear  regression  function  (/3),  322 

determination  of,  when  cost  of  each  measure- 
ment depends  upon  its  magnitude,  319-328 
when    dependent    variable    is    expressed    in 
probits,  343,  350 
references  to,  343n.,  352 
relation  of,  to  a,  343 
two-point  estimate  of,  formula  for,  350 
Small  samples,  effect  of  grouping  or  rounding  on 
statistical  analyses  of,  203-215 
efficiency  of,  as  basis  for  decisions,  156,  157 
Smith,  B.  Babington,  120n.,  184 
Snedecor,  George  W.,  405,  415 
Solomon,  Herbert,  ix,  384 
Specifications,  inconsistent,  50n., 

redundant,  50n.,  51n. 
Sponsorship,  vii 
Square-root  transformation,  411, 


51n. 


414 


Stabilizing  variance,  by  transformation  of  data, 

408-414 
Standard;  new  method,  process,  or  product  com- 
pared with,  with  respect  to  fraction  defective, 
7-93,  27n.,  28n.,  231-234,  237-244,  270,  271, 
274,  275,  278-280,  423 
with  respect  to  level  of  performance,  17,  27n., 

28n.,  59-63,  81,  82,  423 
with    respect    to    variability    of    performance, 
227-234,  270,  271,  274,  275,  278-280 
(See  also  Old  method) 
Standard  deviation,  of  binomial  distribution,  255, 
397 
of  correlation  coefficient  (see  Variance) 
of  lot  (a),  as  measure  of  lot  quality,  27n.,  28n., 
229-231,  269-296 
(See  also  Standard  deviation,  of  population) 
of  mean  (see  Variance) 
of  mean  deviation  (see  Variance) 
of  median  (see  Variance) 

of  normal  distribution  (<r),  9,   12,  58,  97,   161, 
229,  342,  378,  423,  426 
bias  of  s  as  estimator  of,  58 

adjustment  factor,  formula  for,  58 
tables  of,  references  to,  58n. 
as  parameter  of,  12,  161 

as  reciprocal  of  slope  of  regression  line,  when 
dependent  variable  is  expressed  in  probits, 
343 
of  number  of  items  inspected  (<r(N)),  238 


Standard  deviation,  of  number  of  items  inspected 
(o-(JV)),   figures  showing,  for  double-,   multi- 
ple-, and  sequential-sampling  plans,  241,  243 
of  Poisson  distribution   (see  Variance) 
pooled  estimate  of,  43 

of  population  (a),  bias  of  s  as  estimator  of,  in 
normal  samples,  58 
confidence  limits  for  (see  Variance) 
estimation  of,  from  mean  deviation,  162,  163, 
169 
from  probit  data,  342-352 
from  range,  227,  227n. 
from  s,  in  normal  samples,  58 
from  sensitivity  tests,  342-352 
(See  also  Variance) 
importance  of  distinguishing  from  s,  12 
as  parameter  of  normal  distribution,  12,    161, 
tests  of  significance  relating  to,  sequential, 
232,  233,  426 
single-sample,  231,  232,  269-296,  385-394 
of  proportion  or  percentage,  255,  397 
of  r  (see  Variance) 
of  range,  227n. 

of  rectangular  distribution,  218 
of  s,  in  normal  samples,  formula  for,  58 
of  sample,  12,  13,  187n.,  188n. 
Standard  error  (see  Standard  deviation) 
Standard  method,  process,  or  product,  compared 

with  a  new  (see  Old  method) 
Stanford  University,  ix,  xi,  xii,  96,  384 
Statistical  hypothesis,  alternative,  419,   420 
null,  420 

under  test,  419,  420 
Statistical  Research  Group,  iii,  ix-xi,  6,  8n.,  20n., 
26n.,  34n.,  42rc.,  52n.,  83n.,  85n.,  93,   101n., 
108n.,    109n.,    143n.,    184,    188,    231n.,    234, 
237n.,  244,  257n.,  260n.,  335n.,  358n.,  361n., 
362,  400,  405n.,  415,  419n.,  420n.,  428,  429 
senior  scientific  staff  of,  list  of  names,  ix,  x 
Statistical  tests,   analysis  of  variance,   312-314, 
385,  386 
for  components  of  variance,  312-314 
for  homogeneity  of  means,  385,  386 
Bartlett's,   for   homogeneity    of   variance   esti- 
mates, 183,  386-388 
based    on   sign    of    difference   between   sample 
means,  377-382 
conditions  rendering  optimal,  381,  382 
bias  of,  if  Sheppard's  corrections  are  used,  199, 

200,  203 
binomial  distribution,  based  on,  249,  250,  255 
for  comparing  parameters  of  two,  249-258, 

424,  425 
for  testing  parameter  of  a  single,  260n.,  263, 
264,  421,  422 
with  provision  for  rejection  only,  358 
Xc-test,  254-257 

X2-test,  for  deciding  from  si2,  whether  o-i2  >  <r02, 
270-278 
for    deciding    from    si2,    whether    <ri2  <  tr02, 

278-280 
for  deciding  from  "sum  of  squares"  whether 

a2  >  <ri2,  231,  232 
for  homogeneity  of  proportions,  256,  257 
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Statistical  tests    x2-test,  for  homogeneity  of  pro- 
portions, of  2  X  2  table,  254,  255 
of  variance  estimates,  386-388 
Cochran's,  for  significance  of  largest  variance 

estimate,  383-394 
for  comparing  two  binomial  distributions,  249- 
258,  263,  264,  424,  425 
two  means,  61,  62 
two    percentages    or    proportions,     249-258 

263,  264,  424,  425 

two  standard-deviation  or  variance  estimates, 
280-297 
for  components  of  variance,  312-314 
curtailed,  14,  238,  361 

for  determining  whether  new  method,  process, 
or  product  exceeds  a  standard,  with  respect 
to  fraction  defective,  7-93,  27n.,  28n., 
227-234,  237-244,  270,  271,  274,  275,  278- 
280,  421-423 
with    respect    to    level    of    performance,    17 

27n.,  28n.,  59-63,  81,  82 
with  respect  to  variability  of  performance, 
227-234,  270,  271,  274,  275,  278-280 
for  determining  whether  a  particular  one  of  two 
methods,  processes,  or  products  exceeds  the 
other,  with  respect  to   fraction  defective, 
249-258,  263,  264,  424,  425 
with  respect  to  level  of  performance,  59-63, 

380 
with  respect  to   variability  of  performance, 
280,  282,  283,  296 
for  difference  of  means,  59-63,  380 

of  percentages  or  proportions,  249-258,  263, 

264,  424,  425 

for  double  dichotomy,  249-258,  424,  425 
double-sampling,  for  fraction  defective,  237-244 
effect  of  grouping  or  rounding  on,   with  large 
samples,  191-203 
with  small  samples,  203-215 
errors  of  the  first  and  second  kinds  associated 

with,  271,  281,  282 
F-test,  in  analysis  of  variance,  311-314 
for  components  of  variance,  311-314 
for  homogeneity  of  means,  385-386 
of   ratio   of   standard   deviation   or   variance 
estimates,  280-296 
for  four-fold  table,  249-258,  424,  425 
for  fraction  defective,  based   on    enumeration 
data,   237-244,   249-258,   260n.,   263,   264, 
421,  422,  424,  425 
based    on    measurements,    7-93,    27n.,    28n., 
227-234,  270,  271,  274,  275.  278-280,  423 
general  one-parameter,  sequential,  426-428 

single  sample,  61,  62 
generalized  Student's  ratio  (see  Hotelling's  T) 
for  generalized  variance,  reference  to,  166 
graphical,  34-37,  52-56,  66-71,  232,  233,  260, 

356-359,  398,  414n.,  416,  422,  423,  425 
grouping,   effect   of,    on,    with    large    samples, 
191-203 
with  small  samples,  203,  215 
for  homogeneity,  of  means,  61,  62,  385,  386 
of  percentages  or  proportions,  249-258,  263, 
264,  424,  425 


Statistical     tests,     for     homogeneity,     of     2X2 
table,  249-258 
of  variance  estimates,  385-389 
Hotelling's  T,  114-116,  152-156 
indifference  value  in,  282 

for  mean,  comparison  of  two  or  more,  59-63, 

380,  385,  386 

test  of  a  single,  61-63,  71-73,  83-85,  197,  198, 

200-203,  212,  213,  423,  426 

mechanical  (see  Fulcher-Zubin  Item  Analyzer) 

multiple-sampling,  for  fraction  defective,  237  - 

244 
multivariate,  50-52,   114-116,   147-156 
number   of   observations   required    (see   N;    N; 
Na;  Nm;  Ns;  Nt) 
(See  also  Amount  of  inspection;  ASN  curve; 
Computation  of;  Formulas) 
omnibus,  386 

one-decision  sequential  binomial  test,  358 
one-sided  or  one-tail,  52,  57,  150,  152,  156,  426 
operating     characteristics     of     (see     Operating 
characteristics ;        Operating-characteristic 
curves;    Operating-characteristic    surface) 
for  order  of  population  means,  377-382 
planning  (see  Design  of  experiments) 
for  proportions  or  percentages,  comparison  of 
two,  249-258,  263,  264,  424,  425 
test  of  a  single,  237-244,  260n.,  263,  264,  421, 
422 
with  provision  for  rate  of  failure  of  equipment, 

355-362 
for  range  in  samples  of  k,  formulation  in  terms 

of  sample  "sum  of  squares,"  229-231 
for  ratio  of  two  standard-deviation  or  variance 

estimates,  280-296 
risks  associated  with,  271,  281,  282 

(See  also  Operating-characteristic  curves) 
sequential,  83-85,  232,  233,  358-362,  421-426 
for    sign     of    difference    between     population 

means,  377-382 
significance  levels  of  (see  Significance  levels) 
for  standard  deviations  or  variances,  compari- 
son of  two  or  more,  231-233,  269-296,  385- 
394,  426 
T-test,  Hotelling's,  113-116,  147-156 
t,  non-central,  71-73 

Student's,  72,  147-149,  200-203,  214,  215 
relation  of,  to  Student's  z-test,  200 
two-sided  or  two-tail,  52-57,  150,  152,  156,  426 
transformation  of  data  in,   257-259,   261-264, 

395-416 
truncated,  14,  238,  361 
for  variance  components,  312-314 
for  variances  (see  Statistical  tests,  for  standard 
deviations) 
Statistics,    sample,    contrasted    with    population 

parameters,  161 
Steffensen,  J.  F.,  221n.,  223 
Stigler,  George  J.,  ix 

Student   (William   Sealy  Gosset),   187,  200,   201, 
203,  223 
quotation  from,  187,  188 
Student's  t,  definition  of,  72,  200 
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Student's  t,  distribution  of,  effect  of  grouping  or 
rounding  on,  with  large  samples,  200-203 
with  small  samples,  187,  188,  214,  215 
percentage  points  of,  asymptotic  expansions  for, 
148,  149n.,  184 
tables  of,  references  to,  148n. 
relation  of,  to  Student's  z,  200 
Sheppard's  corrections  not  to  be  used  in  com- 
puting, 203 
Student's  z,  definition  of,  187n. 

effect    of    grouping    on,     187,     188     (See    also 

Student's  t) 
relation  of,  to  Student's  t,  200 
Systematic  variation,  310,  311 


T:  angular     difference     between     two     binomial 

probabilities  Pe  and  Pa,  262,  264 
T:  generic  symbol  for  an  estimator  of  a  parameter 

0,  408 
T:  Hotelling's  generalized  Student  ratio,  definition 
of,  142,  154,  155 
distribution  of,  154,  155 

asymptotic  expansion  for  percentage  points 

of,  155 
free  from  nuisance  parameters,  115 
relation  to  x2-distribution,  155 
to  F-distribution,  155 
optimum  properties  of,  147 

as  "score,"  combining  results  of  two  or  more 
tests,  113-116 
T:  Zubin's  symbol  for  a  transformed  proportion, 

405,  415 
T-statistics,  advantages  of,  115,  164 

applied  to  air  testing  of  bombsights,   122-142, 

164-168 
control  charts  for,  figures  showing,  132,  134,  136 
control  limits  for,  computation  of,  152—155 
definitions  of,  and  relationships,  131,  133,  153, 

154 
distributions  of,  152-155,  164 
uses  of,  115 

yardsticks  for,  computation  of,  156 
t:  angular  difference  between  two  observed  pro- 
portions ps  and  ps,  264 
t:  ratio  of  a  normally  distributed  variable  to  an 
independent     estimate     of     its     standard 
deviation,  72,  73 
(See  also  Student's  t;  Non-central  t) 
to:  Johnson  and  Welch  notation  for  a  given  level 

of  non-central  t,  73 
Tables,  checking  of,  x 

list  of  principal  reference,  xiii 
Tabular  entries  or  values,  effect  of  further  round- 
ing of,  191 
Fisher- Wishart  recommendation  regarding,  221 
statistical  properties  of,  215-218,  221 
Tchebycheff  inequality,  49,  221 
6:  angle  corresponding  to  an  observed  proportion, 

257,  258,  398,  400 
6:  generic  symbol  for  a  parameter  of  a  probability 

distribution,  408,  426 
0p:  value  of  0  when  null  hypothesis  is  true,  426 


0i :  value  of  0  when  alternative  hypothesis  is  true, 

426 
Thompson,  Catherine  M.,  110,  151n.,  152n.,  184, 
315-318,  333,  335n.,  336n.,  338,  388n.,  393, 
394 
Tightened  inspection,  41-45 

criterion  for  changing  to,   when  lot  quality  is 
measured  by  fraction  defective,  42 
procedure   for   judging   whether   criterion   is 
met,  using  variables  data,  42—45 
when  upper  and  lower  limits  are  imposed 
on  a  single  quality  characteristic,  56 
effect  of,  on  OC  curve,  45 
methods  of  achieving,  with  variables  inspection 

for  fraction  defective,  45 
purpose  of,  41,  42 
Time  to  rupture,  for  test  specimens,  321,  365 
determination  of,  when  cost  of  each  measure- 
ment depends  on  its  magnitude,  322-328 
determination    of    composition   yielding    maxi- 
mum, 365-372 
Time  trends,  effect  of,  on  statistical  analyses,  371, 

372 
Tippett,  L.  H.  C,  120n.,  184,  227n.,  234 
Tolerance    factor    (K),    for   normal    distribution, 
97-108 
definition  of,  97,  98 
formulas  for,  101,  108 
table  of  values  of,  102-107 
uses  of,  examples  of,  100,  101 
(See  also  Tolerance  limits) 
Tolerance  limits,  statistical,  47,  97-108 
conditions  for  satisfactory  use  of,  97 
contrasted  with  confidence  limits  for  popula- 
tion mean,  99,  100 
definition  of,  one-sided,  47,  98,  99 

two-sided,  97,  98 
distribution-free,  99,  109 
for  non-normal  distributions,  99,  109 
for  normal  distributions,  47,  97-99,  100-108 
one-sided,  47,  98,  99 

computation  of,  example,  47 
two-sided,  47n.,  97-99,  100-108 
examples  of,  100,  101 
factors  for,  table  of,  102-107 
formulas  for,  101,  108 
Transformation,  angular,  257-259,  261-264,  336, 
395-416 
general,  to  stabilize  variance,  408-410 
inverse  hyperbolic  tangent,  413 
inverse  sine,  257-259,  261-264,  336,  395-416 
logarithmic,  of  F,  317n. 
of  r,  413 
of  s  or  s2,  413 
probit,  343,  352 
square  root,  411,  414 
z-,  of  correlation  coefficient  (r),  413 
of  variance  ratio  (F),  317n. 
Transformed  correlation  (z),  413 
Transformed   percentage   or   proportion    (<p,T,6), 
257,  258,  398,  400,  405,  415 
(See  also  Inverse-sine  transformation) 
Trends,  effect  of,  on  statistical  analyses,  371,  372 
Tukey,  John  W.,  109,  414n, 
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2X2  table  (see  Double  dichotomy) 

Two-sided  criterion,  in    variables    inspection    for 

fraction  defective,  52-57,  63,  64 
Two-sided  or  two-tail  tests,  52-57,  150,  152,  156, 

426 
Two-sided  tolerance  limits,  47rc.,  97-99,  100-108 


U 


U:  normal  deviate  exceeded  with  probability  p, 

72 
U:  upper  limit  of  acceptable  ranges  in  sample  of  k, 

227,  228 
U:  upper  limit  of  a  measured  quality  characteris- 
tic, above  which  an  item  is  defective,  9-12 
(See  also  L) 
selection  of,  46,  47 
as  tolerance  limit,  47 
(/-shaped  distribution,  194,  222 
United  States  Government,  vii 

(See  also  Army;  Bureau  of  the  Census;  De- 
partment of  Agriculture;  National  Bureau 
of  Standards;  National  Defense  Research 
Committee;  Navy;  Office  of  Scientific 
Research  and  Development) 
Universe  (see  Lot;  Population) 
University,  Brown,  ix 

of  Chicago,  iii,  ix-xii,  6,  248,  320,  340,  364,  418 
Columbia,  iii,  vii,  ix-xii 

(See  also  Statistical  Research  Group) 
of  North  Carolina,  ix,  xi,  112,  248 
Princeton,  ix 

Stanford,  ix,  x,  xi,  96,  384 
of  Wisconsin,  ix,  354 
Upper  and  lower  limits  (see  Two-sided  criterion; 
Two-sided  tests;  Two-sided  tolerance  limits) 


Variables,  inspection  by,  7-93,  113-184,  227-234, 
269-296,  377-382,  385-393,  423,  426 
advantages  of,  7-9 
amount  of  inspection  in  (see  N;~N;  Nm]  Ns;  Nt) 

(See  also  Amount  of  inspection) 
conditions  for  advantageous  use  of,  9 
curtailment  of,  14,  15 
disadvantages  of,  8,  9 
for  fraction  defective,  7-93 

acceptance  criteria  in,  12,  14,  17,  33-41,  52- 
56,  67-71,  85 
(See  also  Computation,   of  acceptance  re- 
gion; Formula,  for  acceptance  criterion; 
Graphical  procedure) 
advantages  of,  7-9 

amount  of  inspection,   17,  21,  27,  28,  64-66, 
81,  82,  85 
(See   also   Amount   of   inspection;    N;    N; 
Nm;  N*;  Nt) 
condition  for  advantageous  use  of,  9 
contrasted   with  attributes  inspection,    7—9, 

26-28 
criteria,  use  of  pair  of  one-sided  in,  53,  54 

figure  showing,  55 
criterion,  use  of  single  two-sided,  52-57,  63,  64 


Variables,  inspection  by,  for  fraction  defective, 
criterion  for  changing  to  reduced  or  tight- 
ened inspection  in,  42 
formulas  for  (see  Formula) 
graphical  procedure  in  (see  Graphical  proce- 
dure) 
when  inspection  is  costly,  9 
when   lot    mean    (n)    is    known,    27n.,    28n., 

230-234, 270, 271,  274,  275,  278-280 
when  lot  standard  deviation    (a)   is  known, 

27,  28,  61,  62,  423 
operating    characteristics    of    (see   Operating 
characteristics ;        Operating-characteristic 
curves;  Operating-characteristic  surface) 
plan  of  action,  14,  16-21,  33-41 

choice  of,  45-49 
sequential,  83-85 
single  sample,  7-82 

when  upper  and  lower  limits  are  imposed  on 
a  single  quality  characteristic,  52-57,  63,  64 
for   fraction   of   ranges   in   samples   of    k    that 
exceed  specified  limit,  227-234 
formulation  in  terms  of  <r,  229-231 
multivariate,  113-184 

operating     characteristics     of     (see    Operating 
characteristics;        Operating-characteristic 
curves;  Operating-characteristic  surface) 
for  percent  defective  (see  Variables,  for  fraction 

defective) 
sequential-sampling,  83-85,  232,  233,  423,  426 
single-sampling,  7-82,  113-184,  231,  232,  269- 

296,  377-382,  385-393 
use  of  several  acceptance  criteria  in,  50-56,  63, 
64,  113,  114 
Variability,    determining   whether   new    method, 
process,   or  product  exceeds  a    standard    in 
respect  to,  227-234,  270,  271,  274,  275,  278- 
280 
determining  whether  a  particular  one  of  two 
methods,    processes,    or    products   exceeds 
the  other  in  respect  to,  280,  282,  283,  296 
measures     of     (see    Mean     deviation;     Range; 
Standard  deviation;  Variance) 
Variance,  analysis  of,  297,  310-314,  385,  386 
(See  also  Analysis  of  variance) 
of  binomial  distribution,  397,  410 
components  of,  269,  297,  310-314 

due  to  grouping  or  rounding,  195,  218 
of  correlation  coefficient  (r),  413 
definition  of,  161,  195 

estimates  of,  12,  13,  43,  162,  198  (See  also  s2) 
X2-test  of,  231,  232,  270-280 
F-tests  of,  280-296,  311-314 
homogeneity  of,  Bartlett's  test  for,  183,  387, 

388 
largest    of   several,    Cochran's    test    for    sig- 
nificance of,  383-394 
experiments  pertaining  to,  design  and  analysis 

of,  227-234,  269-318 
of  grouped  distribution,  195 
of  interpolated  value,  219-221 
of  lot  (a2) ,  as  measure  of  lot  quality  (see  Stand- 
ard deviation,  of  lot) 
(See  also  Variance,  of  population) 
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Variance,   of   maximum  likelihood    estimator    of 
mean  of  normal  population,  when  data  are 
grouped  or  rounded,  197 
of  mean,  162,  198 

when  data  are  grouped  or  rounded,  197,  198 
of  mean  deviation,  in  normal  samples,  162,  163, 

169 
of  median,  in  normal  samples,  162 
of  normal  distribution,  161,  198 

(See    also    Standard    deviation,    of    normal 

distribution) 
bias  of  s2  as   estimator  of,   when  data  are 

grouped  or  rounded,  198-200,  203-214 
estimation  of,  from  mean  deviation,  162,  163, 
169 
from  probit  data,  342-352 
from  range,  227 

from  s2,  12,  13,  43,  162,  198-200,  203-214 
from  sensitivity  tests,  342-352 
as  parameter  of,  12,  161 

tests   of  significance  relating  to,  sequential, 
232,  233,  426 
single-sample,  231,  232,  269-296,  383-394 
of  Poisson  distribution,  411 

of   population   (a),   bias   of   s2  as  estimator  of, 
when  data  are  grouped  or  rounded,   198- 
200,  203-214 
confidence  limits  for,  198-200,  213,  214 
estimation    of,    from    mean    deviation,    162, 
163,  169 
from  probit  data,  342-352 
from  range,  227 

from  s2,  12,  13,  43,  162,  198-200,  203-214 
from  sensitivity  tests,  342-352 
importance  of  distinguishing  from  s2,  12 
tests  of  significance  relating  to,   sequential, 
232,  233,  426 
single-sample,  231,  232,  269-296,  383-394 
—  of  proportion  or  percentage,  397,  410 
of  r,  413 

of  range  (see  Standard  deviation,  of  range) 
ratio  (see  F) 

of  rectangular  distribution,  195-218 
of  s2,  in  samples  from  arbitrary,  grouped,  and 
normal  populations,  199 
from  normal  population,  formula,  199,  412 
when     data     are     grouped     or     rounded, 
formula,  199,  211 
stabilizing,  by  transformation  of  data,  general 
procedure  for,  408-410 
applications  of,  410-413 

(See  also  Inverse-sine  transformation) 


w 


Wallis,  W.  Allen,  iii,  x-xii,  6,  167n.,  184,  248,   418, 

419n. 
Weaver,  C.  L.,  318n. 
Weaver,  Warren,  v,  vii,  x 

Weights,  use  of,  in  fitting  line  by  least  squares, 
343,  344 
when  dependent  variable  is  expressed  in  probits, 
343,  344,  350 
Welch,  B.  L.,  8n.,  22,  49,  58n.,  64,  72-74,  76,   77, 
79n.,  86,  93,  99,  381,  382 
quotation  from  paper  by,  with  W.  J.  Jennett, 
49-50 
Wilenski,  H.,  46n.,  93,  255n.,  265 
Wilks,  Samuel  S.,  99,  110,  166n.,  173,  174,  184 
Wishart,  John,  218,  219n.,  220n.,  221,  222 
Wold,  Herman,  193n.,  196n.,  223 
Wolfowitz,  Jacob,  x,  xi,  52n.,  108n.,  110,  330 
Work    sheets,    for    acceptance-region    boundary, 
using  two-sided  criterion,  57 
for  k,  77-79 
for  OC  curve,  32,  80 
WPA  tables  (see  Mathematical  Tables  Project) 
Wright,  Sewall,  298 


x:  a  measurement  of  a  variable  quality  charac- 
teristic, 12 
x:  arithmetic  mean  of  sample  measurements    (see 

Sample  mean) 
x  +  ks,  12,  14,  33,  36,  47n.,  50,  53,  59,  61,   62,   64, 
66,  71,  83 
(See  also  Acceptance  criterion,    in    variables 
inspection   for   fraction    defective;    Toler- 
ance limits) 
bias    of,    as    estimator    of    n  +  ka,    in    normal 

samples,  58 
expected  or  mean  value  of,  in  normal  samples, 

58 
standard  deviation  of,  in  normal  samples,  58 
x  -  ks,  13,  47n.,  53 
X  +  Ks,  98,  99,  102 

(See  also  Tolerance  factor;  Tolerance  limits) 
|:  mean  of  a  normal  distribution,  164 


Yardsticks,  for  differences  of  means,  150 
for  71-statistics,  156 
for  variance  estimates,  156 
Yates,  Frank,  18n.,  93,  120n.,  148n.,  151n.,  152n., 
183,  234,  255,  258n.,  265,  315n.,  317rc.,  318, 
342n.,  343n.,  352,  398,  399n.,  405,  407,  415 
Yates'  correction  of  x2  for  continuity,  254,  257 


W  or  Wk\  multiple  of  a  in  terms  of  which  cumula- 
tive distribution  of  sample  range  in  samples 
of  k  (Rk)  is  given,  229,  230 

w:  width  of  grouping  or  rounding  interval,  193 
(see  Grouping  interval) 

Wald,  Abraham,  ix,  52n.,  83n.,  99,  108n.,  110, 
143n.,  147,  166n.,  184,  419n.,  429 


z:  a  normally  distributed  variable,  61,  73 
^-transformation,    of    correlation    coefficient     (r), 
413 
of  variance  ratio  (F),  317n. 
Zubin,  Joseph,  257,  258,  265,  398,  400 
Zubin-Fulcher  Item  Analyzer,  257,  258,  405,  415 
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